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Introduction: 


The  purpose  of  my  study  is  to  obtain  a  collective  knowledge  of  proteases  which  activity  is 
differentially  overexpressed  during  breast  cancer  progression  using  phage  display  technology.  The 
profile  of  proteases  or  "protease  fingerprint"  will  then  be  validated  as  a  diagnostic  tool  for  predicting 
breast  cancer  progression. 

Body: 

Proteases  are  key  regulators  of  a  wide  range  of  physiological  processes  (1,2),  and  are  recognized  as 
important  and  tractable  drug  targets.  The  activity  of  proteases  is  tightly  regulated  by  inhibitors,  and 
some  of  them  by  protein  activation.  Many  proteases  such  as  matrix-metalloproteinase  are  synthesized 
as  zymogen  (inactive  form)  and  are  activated  later  by  other  proteases  (3-5).  Upon  activation,  protease 
activity  is  then  regulated  by  either  the  non-specific  inhibitors  such  as  a  1 -macroglobulin  or  specific 
inhibitors  such  as  tissue  specific  metalloproteinase  inhibitors  (TIMPs)  for  MMPs  (6).  Mistakes  in  the 
regulation  of  protease  activity  can  lead  to  various  pathological  conditions  such  as  cancer  growth, 
tumor  metastasis,  inflammation,  cardiovascular,  and  autoimmune  diseases  (7-11).  Consequently, 
information  of  protease  activity  in  diseases  and  progression  of  diseases  will  be  important  in 
discovering  biomarkers  and  potential  therapeutic  targets.  In  this  fellowship  proposal,  I  hypothesized 
that  a  profile  of  protease  activity  can  be  used  as  a  diagnostic  tool  to  predict  breast  cancer  progression. 
Three  aims  were  designed  in  this  fellowship  proposal  to  test  this  hypothesis.  So  far,  studies  toward 
completing  these  three  aims  have  lead  to  three  publications  that  I  am  either  listed  as  the  first  or  the 
second  author.  The  manuscripts  are  included  in  this  annual  summary  repot. 

Aim#l.  Construct  A  Substrate  Phage  Display  Library  To  Use  As  A  Tool  For  Studying  Protease 
Fingerprint. 

I  aimed  in  the  fellowship  proposal  to  profile  protease  activity  in  vivo  using  substrate  phage  display 
library.  Construction  of  the  substrate  phage  library  was  reported  in  the  annual  summary  report  last 
year.  To  demonstrate  that  this  substrate  phage  display  library  can  be  used  to  distinguish  catalytic 
activity  of  proteases,  I  used  the  substrate  phage  display  library  to  distinguish  the  catalytic  two  cancer- 
related  proteases,  Metalloproteinase-2  (MMP-2)  and  9  (MMP-9).  MMP-2  and  9  belong  to  a  subfamily 
of  matrix  metalloproteinases  that  is  historically  referred  as  gelatinases.  They  are  highly  homologous 
in  their  overall  structure  and  catalytic  domains.  These  two  proteases  have  received  great  attention 
continuously  because  their  involvement  in  tumor  progression  and  angiogenesis  (8,12-14)  and  are  of 
interest  as  pharmaceutical  targets.  Up  to  now,  no  functional  assay  is  available  to  differentiate  their 
catalytic  activity.  Hence,  the  study  of  MMP-2  and  MMP-9  substrate  profile  will  demonstrate  the 
power  of  substrate  phage  display  library  to  discriminate  highly  similar  proteases.  Additionally,  new 
understanding  of  MMP-2  and  MMP-9  substrate  recognition  profile  will  provide  information  of  distinct 
substrate  recognition  to  aid  the  development  of  a  functional  assay  that  leads  to  uncover  their  functions 
in  the  normal  and  pathological  processes. 

Result  L  Four  Groups  Of  MMP-2  Substrates  Were  Found  Via  Substrate  Phage  Selection 

After  two  rounds  of  substrate  selections  of  MMP-2,  thirty  individual  clones  were  selected  for 
sequencing  based  on  the  fact  that  they  were  cleaved  by  more  than  25%  when  incubated  for  two  hours 


3 


with  activated  MMP-2.  Four  distinct  groups  of  substrates  are  found  among  these  clones.  The  first 
group  of  substrate  contains  the  P-X-X-XHy  motif,  where  Xhv  represents  a  large  hydrophobic  residue. 
This  motif  appears  to  be  a  substrate  for  a  number  of  different  MMPs  (15-17).  Substrates  in  group  II- 
IV  represent  novel  recognition  motifs  for  MMP-2.  Group  II  substrates  contain  the  I/L-X-X-  Xny 
motif,  in  which  the  last  hydrophobic  residue  is  usually  lie  or  Leu.  Group  III  substrates  contain  the 
XHy— S-X-L  motif,  where  Ser  and  Leu  are  invariant.  Group  IV  substrates  are  comprised  of  the  H-X-X- 
XHy  motif,  which  is  similar  to  the  cleavage  site  for  MMP-2  within  laminin-5.  A  list  of  group  I-IV 
substrate  sequences  can  be  found  in  the  manuscript  "  A  Unique  Substrate  Recognition  Profile  for 
Matrix  Metalloproteinase-2"  included  with  the  annual  summary  report. 


Result  2:  Three  Groups  of  MMP-2  Substrates  Were  Selective  For  MMP-2  over  MMP-9. 


Comparison  of  the  rate  of  hydrolysis  of  a  set  of  representative  phage  substrates  by  MMP-2  and  MMP-9 
using  the  substrate  ELISA  was  made  to  determine  the  selectivity  of  the  MMP-2  substrates.  The  ability 
of  MMP-2  (dark  bars)  and  MMP-9  (open  bars)  to  cleave  substrate  selected  from  the  phage  library  were 
compared  using  the  phage  ELISA  and  illustrated  in  figure  1 .  Results  are  presented  as  the  percentage  of 
hydrolysis  compared  to  non-treated  control  phage.  This  experiment  was  repeated  three  times,  yielding 
nearly  identical  results  in  each  repetition. 


Figure  1. 


Group  I  Group  II  Group  III 

Phage  clone 


With  the  exception  of  one  clone,  A45,  the 
phage  from  group  I  lacked  selectivity  for 
MMP-2  over  MMP-9.  In  contrast  however, 
all  of  the  phage  substrates  from  group  II  and 
III  were  highly  selective  for  MMP-2  over 
MMP-9.  The  extent  of  hydrolysis  of  phage 
substrates  from  group  IV  was  not  compared 
using  this  assay  because  their  rate  of 
hydrolysis  by  MMP-2  was  generally  low. 
These  substrates  were  characterized  in  more 
detailed  with  the  aid  of  synthetic  peptides. 

(This  figure  can  also  be  found  in  the 
manuscript  "  A  Unique  Substrate 
Recognition  Profile  for  Matrix 
Metalloproteinase-2"  included  with  the 
annual  summary  report  as  figure  1 .) 


Result  3j  Selectivitx  QL MMP-2  Substrates  Was  Confirmed  By  Charactering  Synthetic  Peptides, 

Representative  peptides  from  groups  I-IV  were  synthesized  and  used  to  characterized  substrate 
hydrolysis  in  greater  detail.  The  peptides  were  initially  used  to  determine  the  position  of  the  scissile 
bond  with  each  motif  by  analyzing  the  cleavage  products  by  MALDI-TOF  mass  spectrometry.  The  rate 
of  hydrolysis  was  measured  by  kinetic  parameters,  Km,  kcat  and  kcJKm  ratio. 
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In  virtually  all  cases,  the  scissile  bond  directly  precedes  a  large  hydrophobic  residue,  which  is  frequently 
lie  or  Leu.  This  feature  is  consistent  with  the  presence  of  a  deep  binding  pocket  at  the  corresponding  Sr 
subsite  within  MMP-2.  Most  significantly,  substrates  from  group  II,  III,  and  IV  exhibit  kcJKm  ratios 
that  are  eight  to  almost  two  hundred-fold  higher  for  MMP-2  than  MMP-9.  The  selectivity  of  each  of  the 
peptide  substrates  for  MMP-2  compared  to  MMP-9  is  represented  by  dividing  the  kcaJKm  ratio  for 
MMP-2  by  the  same  value  for  the  MMP-9  as  shown  in  the  following  table. 


Peptide 

MMP-2/MMP-9  (Selectivity  Ratio) 

Group  I 
(P-X-X-Xhj.) 

A3 

2 

Group  II 
(I/L-X-X-XHy) 

B49 

14 

B74 

108 

Group  III 
(XHy-S-X-L) 

C9 

198 

A13 

15 

B37 

71 

A34 

Group  IV 
(H-X-X-XHy) 

40 

A21 

8 

Result  4:  Hydrolysis  of  A  Protein  Substrate  Containing,  the  S-X-L  Motif. 

Ultimately,  one  would  hope  to  be  able  to  use  the  substrate  recognition  profiles  obtained  from  substrate 
phage,  and  other  substrate  profiling  strategy  (18,19)  to  generate  hypothesis  regarding  physiologic 
substrates.  As  an  initial  step  in  this  direction,  I  compared  the  ability  of  MMP-2  and  9  to  cleave  Eph 
B1  and  Eph  B2.  Eph  B1  and  B2  are  tyrosine  kinase  receptors  that  are  responsible  for  cell-cell 
signaling  during  neuronal  development.  Only  EphBl  has  a  putative  cleavage  site  of  MMP-2 
corresponding  to  the  motif  of  XHy-S-XlL  and  EphB2  lacks  the  predicted  cleavage.  The  Eph  B1  and 
Eph  B2  fusion  proteins  were  incubated  with  equimolar  amounts  (280  nM)  of  MMP-2  and  MMP-9  for 
four  hours.  The  extent  of  hydrolysis  was  gauged  by  SDS-PAGE  (Figure  2).  The  Eph  Bl-Fc  fusion 
protein  was  almost  quantitatively  cleaved  by  MMP-2  (Figure  2,  lane  2).  The  extent  of  cleavage  by 
MMP-9  was  far  lower  (Figure  2,  Lane  3),  which  is  not  surprising  considering  the  selectivity  exhibited 
by  the  group  II  substrates.  Neither  protease  cleaved  the  Eph  B2  fusion  protein.  These  findings 
illustrate  that  motifs  found  to  be  selective  for  MMP-2  by  the  substrate  phage  display  library,  can  act  as 
selective  substrates  within  the  context  of  whole  proteins. 
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Figure  2. 
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Summary  of  results  in  Aim#l: 

MMP-2  and  9  were  found  to  have  distinct  substrate  recognition  profiles.  Novel  and  highly  selective 
MMP-2  substrate  sequences  were  found  in  this  study.  Data  from  this  study  supports  the  hypothesis 
that  substrate  phage  display  library  can  be  used  to  differentiate  diminutive  structural  difference  of 
proteases.  In  addition,  these  novel  and  highly  selective  MMP-2  substrates  will  be  valuable  in 
developing  in  vivo  activity  assay  for  MMPs  and  mechanistic  design  of  specific  MMP  inhibitors. 

It  is  also  encouraging  to  illustrate  a  new  technology  that  has  a  great  utility  in  revealing  potential 
functions  of  proteases.  To  understand  the  reeling  involvement  of  proteases  in  pathological  conditions, 
one  must  approach  from  the  fundamental  biology  of  the  proteases.  Therefore,  structure-functional 
information  obtained  from  this  study  (detailed  in  the  manuscript  of  “a  unique  substrate  profile 
recognition  of  MMP-2”)  will  help  advancing  our  knowledge  of  the  biology  of  matrix 
metalloproteinases . 

Aim#2.  Charcterize  profile  of  protease  activity  present  in  the  blood  of  mice  bearing  non¬ 
metastatic  breast  cancer  tumors  and  mice  bearing  metastatic  breast  cancer  tumors. 

To  test  whether  substrate  phage  display  library  can  be  use  to  profile  protease  activity  in  vivo,  I 
exposed  phage  to  serum  from  normal  SCID  mice,  mice  bearing  non-metastatic  MDA-MD-435  tumor, 
and  mice  bearing  metastatic  MDA-MD-435  tumor.  No  significant  protease  activity  was  detected  in 
any  samples.  Subsequently,  the  serum  was  concentrated  by  either  acetone  or  TCA  precipitation  to 
increase  the  concentration  of  the  protein  used  in  the  reaction  volume.  Also,  several  experiments  were 
conducted  to  find  out  the  lowest  concentration  of  active  protease  that  can  be  detected  by  phage  display 
library  in  the  presence  of  nature  inhibitors  in  the  serum.  I  found  that  no  protease  activity  could  be 
detected  below  lOOng  of  active  protease  in  the  serum  from  a  normal  mouse.  Hence,  a  range  of  protein 
concentration  from  lOOng  to  lOug  (from  5ml  of  mouse  serum)  was  tested  for  detection  of  protease 
activity  in  serum  samples.  Fair  amount  of  protease  activity  was  only  detected  when  using  the  highest 
amount  of  concentrated  protein. 
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It  was  expected  that  the  amount  of  proteases  present  in  the  samples  might  be  small.  Low  protease 
concentration  poses  a  difficulty  to  drive  the  kinetic  reaction  between  proteases  and  substrates  and 
might  potentially  explain  why  no  significant  protease  activity  was  detected  when  the  serum  was  not 
concentrated.  Although  protease  activity  can  be  detected  in  1  Oug  of  total  protein  from  the  serum,  it 
points  to  the  problem  of  low  sensitivity  when  exposing  phage  display  library  directly  to  biological 
samples.  Furthermore,  the  amount  of  serum  that  would  need  to  screen  and  validate  protease  activity 
poses  another  technical  difficulty  to  pursue  this  method.  Hence,  I  modified  my  approach  of  carrying 
out  the  functional  study  of  disease-related  proteases  and  developed  a  one-pot  phage  selection  system 
that  yield  the  substrate  recognition  profile  of  multiple  purified  proteases  from  a  single  round  of 
selection. 


I  reasoned  that  a  phage  display  based  approach  could  be  modified  to  characterize  the  substrate 
recognition  profile  of  all  proteases.  Phage  display,  when  used  in  the  traditional  manner,  is  a  heritable 
system,  allowing  one  to  evolve  optimal  substrates  (or  binders)  through  multiple  rounds  of  selective 
pressure.  It  is  this  property,  however,  that  typically  prevents  phage  from  being  used  to  obtain  substrate 
recognition  profiles  for  multiple  proteases  at  once.  Here,  I  developed  a  strategy  for  obtaining  the 
substrate  recognition  profile  of  multiple  proteases,  simultaneously.  It  couples  one  round  of  substrate 
phage  display  with  a  substrate  ELISA  to  deconvolute  the  specificity  of  protease  substrates.  The 
system  allows  one  to  avoid  problems  caused  by  the  heritable  nature  of  phage  display  by  activity 
profiling  even  the  rarest  of  substrates  after  a  single  round  of  protease  selection.  Furthermore,  it  allows 
all  possible  substrates  to  be  tested  by  any  proteases  individually.  It  allows  one  to  compare  and 
deconvolute  the  substrate  preferences  for  large  panels  of  proteases  in  single  experiment.  A  diagram  of 
the  "one-pot"  protease  substrate  phage  selection  scheme  is  listed  below  to  illustrate  the  expected 
outcome  from  this  method. 


"One-pot”  phage  selection 


Perform  ELISA  screen  to 
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Protease 
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3 
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000 

••• 
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4 

••• 

••• 

000 
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characterization 


Four  proteases  representing  two  catalytic  classes  were  used  to  test  the  system's  ability  to  discriminate 
the  catalytic  activity  of  each  protease.  Within  this  set,  I  included  two  classical  serine  proteases, 
thrombin  and  factor  Xa,  for  which  the  substrate  specificity  has  been  well  characterized.  A  test  of  the 
system  with  "knowns"  would  indicate  the  ability  of  the  system  to  properly  report  substrate  recognition. 
MMP-9  (matrix  metalloproteinase)  and  atrolysin  C  (snake  venom  metalloproteinase)  were  used  as 
representatives  of  metalloproteinases,  whose  recognition  profiles  are  not  well  established  (20).  Within 
each  class,  the  proteases  share  structural  similarity  and,  reportedly,  have  overlapping  substrate 
preferences.  A  comparison  of  the  substrates  selected  for  each  protease  to  the  reported  substrate 
preference  of  each  enzyme  demonstrates  the  power  of  the  approach.  As  the  result  of  a  single  round  of 
substrate  selection  using  the  mixture  of  these  four  proteases,  peptide  substrates  were  identified  that  i) 
have  high  kcJKm  ration,  if)  are  selective  for  individual  proteases,  and  Hi)  match  the  sequences  of 
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known  physiological  substrates.  Please  refer  to  the  paper  that  I  published  this  year  in  Analytical 
Biochemistry  included  in  this  annual  summary  update  for  more  detail  information. 


Result  1:  Selective  substrate s  were  obtained,  from  a  single  round,  of  "one-pot"  pha^e  selection. 

The  substrate  phage  library  was  incubated  with  a  mixture  of  thrombin  (Th),  factor  Xa  (Xa), 

MMP-9  (M9)  and  atrolysin  C  (At).  Phage  substrates  were  selected  in  one  round  of  selection. 

Individual  phage  clones  were  tested  as  substrates  in  an  ELISA  assay.  The  extent  of  hydrolysis  is 
measured  by  the  loss  of  the  Flag  epitope  at  the  N-terminus  before  the  substrate  insert  and 
expressed  as  a  percentage  of  the  total  relative  to  untreated  controls,  and  is  the  average  of  two 
independent  experiments.  The  hexamer  sequence  displayed  by  individual  phages  was  derived 
from  the  DNA  sequence  of  the  phage  inserts.  Highly  selective  peptides  for  individual  proteases 
tested  were  identified  by  substrate  phage  ELISA.  A  list  of  the  substrate  sequences  is  in  the  table 
I  of  the  paper  title  “Protease  Profiling  by  Phage  Display”  in  the  journal  Analytical  Biochemistry 
included  in  the  fellowship  update. 

Result  hPhage  substrates  identified  for  thrombin  and  factor  Xa  matched  their  physiologic  substrate 
seguences. 

By  aligning  the  substrates  cleaved  more  than  60%  by  thrombin,  two  groups  of  substrates  become 
apparent  (Table  2).  The  consensus  motif  derived  from  a  combination  of  the  groups,  (Gly/Pro)- 
(Are/Lys)-i-Ser-(Trp/Phe),  is  in  agreement  with  the  known  substrate  preference  of  thrombin  and 
matches  the  sequence  surrounding  the  scissile  bond  in  several  physiologic  substrates  for  thrombin, 
including  the  thrombin  receptor  (protease-activated  receptor),  factor  V  and  factor  VII  [  ].  Similarly, 
phage  substrates  identified  for  factor  Xa  contain  a  distinct  recognition  motif  (Gly-Arg-l-X)  that 
matches  well  with  the  prothrombin  activation  site,  the  main  physiologic  target  of  factor  Xa  (21).  A 
comparison  table  is  listed  in  table  II  of  the  paper  title  “Protease  Profiling  by  Phage  Display”  in  the 
journal  Analytical  Biochemistry  included  in  this  annual  summary  update. 

Result  3:  The  substrate  phaze  ELISA  as  a  semi-quantitative  indicator  ofjubstrqte  hydrolysis. 

Experiments  were  conducted  to  determined  how  well  the  substrate  phage  ELISA  predicts  the  rank- 
order  preference  of  analogous  synthetic  peptides.  This  feature  was  explored  because  of  it  potential  to 
be  adapted  to  high  throughput  screening.  Screening  protease  substrates  using  synthetic  peptides  is 
costly  and  time-consuming.  Therefore,  to  speed  up  the  protease  substrate  screening  process,  I 
explored  the  possibility  of  using  substrate  phage  ELISA  as  an  indicator  of  the  relative  hydrolysis  rate 
of  the  substrate  sequence  by  the  targeted  protease. 

The  hydrolysis  of  a  series  of  phage  substrates  for  MMP-9  was  measured  as  a  function  of  time  (figure 
2A)  and  compared  to  the  kcJKm  ratio  for  the  corresponding  synthetic  peptides  (figure  2B).  Results 
from  phage  ELISA  and  actual  kcJKm  of  the  peptide  were  compared.  Both  method  yielded  the  same 
rank  order  of  preference  of  substrate,  A6  >  A7  >  A 16.  Similar  observations  were  made  when  phage 
substrates  for  thrombin  were  compared  to  the  analogous  synthetic  peptides.  Figure  2A  and  2B  are  in 
the  paper  title  “Protease  Profiling  by  Phage  Display”  in  the  journal  Analytical  Biochemistry  included 
in  the  fellowship  update. 


8 


Conclusions  and  future  plans: 

Large  body  of  work  on  studying  protease  expression  in  diseases,  especially  in  cancer  progression,  has 
been  reported  and  continues  to  grow.  Combining  the  information  of  protease  expression  and  one-pot 
phage  selection  system,  one  can  obtain  highly  selective  peptide  substrates  for  disease-related  proteases 
rapidly.  Then,  an  array  of  selective  peptide  substrates  can  be  assembled  and  synthesized  with 
fluorescent  tags  for  example  to  increase  the  detection  sensitivity.  Large  screening  of  protease  activity 
profile  can  then  be  carried  out  to  study  pathological  conditions  in  vivo  such  as  the  breast  cancer  mouse 
model  as  designed  in  the  fellowship  proposal  to  identify  diagnostic  markers  in  breast  cancer 
progression.  Therefore,  my  future  plans  are  the  following. 

1.  Profiling  activity  of  more  breast  cancer  relevant  proteases  such  as  MT1-MMP  to  obtain  highly 
selective  peptide  substrates. 

2.  Explore  methods  to  assemble  the  array  of  selective  peptides  such  as  fluorescent  tagging  for  in  vivo 
screening. 

3 .  Establish  the  orthotopic  breast  cancer  model  as  described  in  the  original  fellowship  proposal  to 
obtain  the  serum  as  well  as  the  tumor  tissue  for  in  vivo  protease  activity  profiling. 
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KEY  RESEARCH  ACCOMPLISHMENTS: 

•  Highly  selective  and  novel  MMP-2  substrate  sequences  were  found  via  substrate  phage  display 
library. 

•  The  selectivity  of  the  novel  MMP-2  peptide  substrates  was  confirmed  by  the  rate  of  hydrolysis  of 
the  synthetic  peptides. 

•  A  method  based  on  the  strategy  of  substrate  phage  display  library  was  developed  to  obtain  the 
substrate  recognition/selectivity  profile  of  multiple  proteases  simultaneously. 

REPORTABLE  OUTCOMES: 

Two  published  papers  and  one  accepted  manuscript  are  listed  as  the  following: 

Kridel,  S.  J.,  Chen,  E.,  Kotra,  L.  P.,  Howard,  E.  W.,  Mobashery,  S.,  and  Smith,  J.  W.  (2001)  Journal  of 
Biological  Chemistry  276(23),  20572-20578 

Kridel,  S.  J.,  Chen  E.,  Smith,  J.W.  (2001)  Analytical  Biochemistry  (Currently  only  online  version  is 
available). 

Chen,  E.,  Kridel,  S.J.,  Smith,  J.W.  (2001)  the  manuscript  titled:  “A  unique  substrate  recognition 
profile  for  Matrix  metalloproteinase-2”  was  accepted  in  May  by  Journal  of  Biological  Chemistry 
(Ml:03823) 

CONCLUSIONS: 

MMP-2  and  9  were  found  to  have  distinct  substrate  recognition  profiles.  Novel  and  highly  selective 
MMP-2  substrate  sequences  were  found  in  this  study.  Data  from  this  study  supports  the  hypothesis 
that  substrate  phage  display  library  can  be  used  to  differentiate  diminutive  structural  difference  of 
proteases.  In  addition,  these  novel  and  highly  selective  MMP-2  substrates  will  be  valuable  in 
developing  in  vivo  activity  assay  for  MMPs  and  mechanistic  design  of  specific  MMP  inhibitors. 

It  is  also  encouraging  to  illustrate  a  new  technology  that  has  a  great  utility  in  revealing  potential 
functions  of  proteases.  To  understand  the  reeling  involvement  of  proteases  in  pathological  conditions, 
one  must  approach  from  the  fundamental  biology  of  the  proteases.  Therefore,  structure-functional 
information  obtained  from  this  study  (detailed  in  the  manuscript  of  “a  unique  substrate  profile 
recognition  of  MMP-2”)  will  help  advancing  our  knowledge  of  the  biology  of  matrix 
metalloproteinases . 

In  additional  to  the  work  of  MMP-2  and  MMP-9  substrate  profile  comparison,  I  developed  a  method 
of  profiling  the  activity  of  multiple  proteases  simultaneously  using  the  substrate  phage  display  library 
that  I  constructed  last  year.  Large  body  of  work  on  studying  protease  expression  in  diseases, 
especially  in  cancer  progression,  has  been  reported  and  continues  to  grow.  Combining  the  information 
of  protease  expression  and  one-pot  phage  selection  system,  one  can  obtain  highly  selective  peptide 
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substrates  for  disease-related  proteases  rapidly.  Then,  an  array  of  selective  peptide  substrates  can  be 
assembled  and  synthesized  with  fluorescent  tags  for  example  to  increase  the  detection  sensitivity. 
Large  screening  of  protease  activity  profile  can  then  be  carried  out  to  study  pathological  conditions  in 
vivo  such  as  the  breast  cancer  mouse  model  as  designed  in  the  fellowship  proposal  to  identify 
diagnostic  markers  in  breast  cancer  progression. 
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APPENDICES: 


Two  original  copies  of  the  paper  I  published  this  year  and  one  manuscript  are  included  to 
supplement  and  clarify  the  annual  summary  update. 
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The  catalytic  clefts  of  all  matrix  metalloproteinases 
(MMPs)  have  a  similar  architecture,  raising  questions 
about  the  redundancy  in  substrate  recognition  across 
the  protein  family.  In  the  present  study,  an  unbiased 
phage  display  strategy  was  applied  to  define  the  sub¬ 
strate  recognition  profile  of  MMP-9.  Three  groups  of 
substrates  were  identified,  each  occupying  a  distinct  set 
of  subsites  within  the  catalytic  pocket.  The  most  preva¬ 
lent  motif  contains  the  sequence  Pro-X-X-Hy-CSerAThr) 
at  P3  through  P2>.  This  sequence  is  similar  to  the  MMP 
cleavage  sites  within  the  collagens  and  is  homologous  to 
substrates  the  have  been  selected  for  other  MMPs.  De¬ 
spite  this  similarity,  most  of  the  substrates  identified 
here  are  selective  for  MMP-9  over  MMP-7  and  MMP-13. 
This  observation  indicates  that  substrate  selectivity  is 
conferred  by  key  subsite  interactions  at  positions  other 
than  P3  and  Pr.  This  study  shows  that  MMP-9  has  a 
unique  preference  for  Arg  at  both  P2  and  Px,  and  a 
preference  for  Ser/Thr  at  P2..  Substrates  containing  the 
consensus  MMP-9  recognition  motif  were  used  to  query 
the  protein  data  bases.  A  surprisingly  limited  list  of 
putative  physiologic  substrates  was  identified.  The 
functional  implications  of  these  proteins  lead  to  testable 
hypotheses  regarding  physiologic  substrates  for  MMP-9. 


Matrix  metalloproteinase-9  (MMP-9)1  is  a  member  of  the 
matrixin  family  of  metallo-endopeptidases  (1-3).  MMP-9  is 
historically  referred  to  as  gelatinase  B  because  of  its  ability  to 
cleave  gelatin,  a  denatured  form  of  collagen,  in  vitro.  Along 
with  MMP-2,  MMP-9  differs  from  other  MMPs  because  it  con¬ 
tains  three  fibronectin  type  II  repeats  that  have  high  binding 
affinity  for  collagen.  These  repeats  are  thought  to  mediate  the 
binding  of  MMP-2  and  -9  to  collagen  (1,  2).  This  binding  inter- 
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action  brings  the  catalytic  pocket  of  the  MMP  in  proximity  to 
collagen,  thereby  enhancing  its  rate  of  hydrolysis.  Despite 
these  well  characterized  biochemical  interactions,  it  is  now 
clear  that  MMP-9  is  also  able  to  cleave  a  number  of  other 
proteins  and  may  have  a  rather  wide  range  of  physiologic 
substrates  (4-8). 

Much  of  our  understanding  of  the  biological  function  of 
MMP-9  comes  from  the  study  of  mice  lacking  this  gene.  For 
example,  MMP-9-deficient  mice  have  impaired  ossification  of 
the  skeletal  growth  plate,  a  defect  that  has  been  partially 
attributed  to  poor  vascularization  of  developing  bone  (9).  Stud¬ 
ies  on  these  mice  also  show  that  MMP-9  is  essential  for  the 
recruitment  of  osteoclasts  into  developing  bones  (10).  Other 
work  indicates  that  MMP-9 -deficient  mice  are  resistant  to  der¬ 
mal  blistering  in  a  bullous  pemphigoid  model,  an  effect  that  has 
been  attributed  to  the  inability  of  these  mice  to  cleave  the 
SERPIN  a  1-proteinase  inhibitor  (5).  Finally,  recent  studies  in 
the  RIPl-Tag2  transgenic  mouse  model  of  multistage  carcino¬ 
genesis  indicate  that  MMP-9  is  part  of  the  angiogenic  “switch” 
that  is  essential  for  tumor  growth  (11, 12).  Other  reports  sug¬ 
gests  that  MMP-9  may  play  a  role  in  inflammation  in  the 
nervous  system.  MMP-9  is  elevated  in  encephelomyelitis  (7, 8), 
in  the  cerebrospinal  fluid  of  patients  with  multiple  sclerosis 
(13),  and  in  patients  with  AIDS-related  dementia  (14). 

A  first  approximation  of  the  substrate  recognition  specificity 
of  MMP-9  has  been  gleaned  from  alignments  of  its  cleavage  site 
within  a  number  of  different  proteins  (4,  8,  9,  15-17).  Never¬ 
theless,  a  detailed  understanding  of  subsite  preferences  for 
MMP-9  is  lacking.  Such  an  analysis  is  particularly  important 
because  the  catalytic  cleft  of  MMP-9  is  closely  related  to  other 
MMPs,  raising  questions  about  the  distinction  among  sub¬ 
strates  for  these  proteases.  These  issues  are  particularly  im¬ 
portant  because  many  of  the  current  pharmacologic  antago¬ 
nists  of  the  MMPs  have  overlapping  inhibition  profiles. 
Furthermore,  in  the  post-genomic  era,  where  the  sequences  of 
all  proteins  will  soon  be  available,  information  on  substrate 
recognition  could  help  identify  important  physiologic 
substrates. 

With  these  ideas  in  mind,  we  applied  an  unbiased  approach 
to  define  the  substrate  recognition  preference  of  MMP-9. 
MMP-9  was  used  to  cleave  substrates  within  a  vastly  complex 
phage  display  library  of  random  hexamers.  Substrates  within 
this  library  can  be  cleaved  at  any  position  within  the  hexamer, 
allowing  information  on  the  substrate  specificity  on  both  sides 
of  the  scissile  bond  to  be  obtained.  Three  families  of  substrates 
were  identified.  The  largest  group  contains  a  Pro-X-X-Hy-(Ser/ 
Thr)  motif  (X  is  any  residue,  and  Hy  is  a  hydrophobic  residue) 
that  occupies  positions  P3  through  P2..  This  general  motif  is 
cleaved  by  a  number  of  MMPs  and  is  presumed  to  represent  a 
collagen-like  substrate  (18-23).  Nevertheless,  substrates 
within  this  family  that  were  selected  show  considerable  selec- 

This  paper  is  available  on  line  at  http://www.jbc.org 
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tivity  for  MMP-9.  The  second  group  of  substrates  are  defined  by 
a  Gly-Leu-(Lys/Arg)  motif  at  positions  Px  through  P2,.  Members 
of  the  third  family  of  substrates  are  unique  in  that  they  contain 
Arg  residues  at  both  Px  and  P2.  Altogether,  these  findings 
reveal  multiple  modes  of  substrate  recognition  by  MMP-9  and 
provide  important  insights  into  the  hydrolysis  of  physiologic 
substrates  that  may  be  important  in  biology  and  pathology. 

EXPERIMENTAL  PROCEDURES 

Purified  forms  of  full-length  MMP-7,  MMP-13,  and  TIMP-2  were 
purchased  from  Chemicon  (Temecula,  CA).  Ilomastat  (GM6001)  was 
purchased  from  AMS  Scientific  (Concord,  CA).  Restriction  enzymes 
were  from  Roche  Biosciences  or  New  England  Biolabs.  Oligonucleotides 
were  synthesized  by  Integrated  DNA  Technologies,  Inc.  Tissue  culture 
media  and  reagents  were  from  Irvine  Scientific  (Irvine,  CA).  All  other 
reagents,  chemicals,  and  plasticware  were  from  Sigma  or  Fisher. 

Construction  of  Substrate  Phage  Display  Library — Substrate  phage 
libraries  were  generated  using  a  modified  version  of  the  fUSE5  phag- 
emid  (24-26).  A  FLAG  epitope  was  engineered  at  the  NH2  terminus  of 
the  genelll  protein  by  annealing  oligonucleotides  5 ' -CCGGGTTT GTC- 
GTC GTCGTCTTTGTAGTCGGTAC-3 '  and  5'  -CGACTACAAAGACGA- 
CGACGACAAAC-3'  and  ligating  them  into  fUSE5  at  the  Kpnl  andXial 
restriction  sites.  The  random  hexamers  were  generated  by  PCR  ext¬ 
ension  of  the  template  oligonucleotide  5 ' - GGGGAGGC C GAC GT GGC C- 
GTCATCAGGCGGCTCAGGC(NNK)eACGG  CCTCTGGGGCCGAAAC- 
3',  where  N  is  any  nucleotide  and  K  is  either  G  or  T.  The  template 
oligonucleotide  also  encodes  an  SGGSG  linker  positioned  in  between 
the  FLAG  epitope  and  the  random  hexamer.  A  primer  oligonucleotide 
5'  -AATTTCTAGTTTCGGCCCCAGAGGC-3 '  and  the  template  oligonu¬ 
cleotide  were  mixed  and  heated  at  65  °C  for  2  min.  The  heating  block 
was  switched  off  and  allowed  to  cool  passively  to  40  °C  to  allow  annea¬ 
ling  of  the  extension  oligonucleotide  to  the  template  oligonucleotide. 
Elongation  of  the  template  oligonucleotide  was  performed  using  Sequ- 
enase  (United  States  Biochemical  Corp.)  (27).  The  final  cDNA  product 
was  precipitated  with  ethanol,  re-suspended  in  water,  and  digested 
with  Sfil.  The  DNA  insert  and  fUSE5  were  mixed  and  ligated  at  a  5:1 
molar  ratio  and  electroporated  into  Escherichia  coli  MC1061(F-). 
Several  phage  were  selected  for  sequencing  to  confirm  randomness  in 
the  insert  sequences  and  the  correct  reading  frame. 

Expression  and  Purification  of  MMP-9 — The  cDNA  encoding  the 
catalytic  domain  of  MMP-9  (28,  29)  was  generated  by  PCR  and  cloned 
into  pCDNA3  (Invitrogen).  The  expression  vector  was  used  to  transfect 
HEK  293  cells  by  electroporation,  and  individual  antibiotic-resistant 
colonies  were  isolated  with  cloning  rings,  expanded,  and  then  screened 
by  reverse  transcription-PCR  for  MMP-9  mRNA  Expression  of  the 
proteinaceous  catalytic  domain  was  determined  by  zymography  using 
conditioned  medium  from  the  transfected  cells.  Cells  expressing  the 
MMP-9  catalytic  domain  were  grown  in  150-mm  dishes  and  conditioned 
in  Dulbecco’s  modified  Eagle’s  medium  with  10%  fetal  bovine  serum 
supplemented  with  G418  (200  pg/mi)  for  5  days.  The  catalytic  domain  of 
MMP-9  was  purified  from  the  conditioned  medium  by  gelatin-Sepha- 
rose  chromatography  as  described  previously  (28,  29).  Fractions  con¬ 
taining  MMP-9  were  pooled  and  dialyzed  against  50  mM  Tris,  pH  7.8, 10 
mM  NaCl.  The  dialyzed  material  was  then  purified  further  using  a  1-ml 
Hi-Trap  Source  Q  ion  exchange  column  (Amersham  Pharmacia  Bio¬ 
tech).  Fractions  containing  MMP-9  were  pooled  and  dialyzed  against  50 
mM  Tris,  pH  7.5, 100  mM  NaCl,  10  mM  CaCl2  and  then  concentrated  in 
a  dialysis  bag  using  Aquacide  II  (Calbiochem).  The  final  concentration 
of  purified  material  was  typically  1.5-3  mg/ml. 

Activation  and  Active  Site  Titration  of  Proteases — The  purified  cata¬ 
lytic  domain  of  MMP-9  was  activated  by  incubation  with  2  mM  APMA 
for  16  h  at  room  temperature  (29).  Activation  was  monitored  by  altered 
migration  of  the  protein  on  SDS-polyacrylamide  gel  electrophoresis  and 
zymography.  Pro-MMP-13  was  activated  with  APMA  for  3  h  at  room 
temperature,  and  MMP-7  required  no  activation. 

The  level  of  active  protease  was  always  quantified  by  active  site 
titration  prior  to  kinetic  studies.  The  active  site  of  MMP-9  was  titrated 
using  the  hydroxamate  inhibitor  Ilomastat  (30).  Briefly,  25  nM  MMP-9 
was  incubated  with  a  range  of  Ilomastat.  Inhibition  of  MMP-9  by 
Domastat  was  then  measured  by  determining  the  rate  of  cleavage  of 
three  concentrations  of  the  peptide,  mlAll,  using  the  fluorescamine 
incorporation  assay  (21,  31).  The  active  sites  of  MMP-13  and  MMP-7 
were  titrated  with  human  TIMP-2.  Briefly,  5-15  nM  of  each  protease 
were  pre-incubated  with  a  range  of  TIMP-2  for  5  h  at  room  temperature. 
Residual  MMP-13  activity  was  measured  by  cleavage  of  MCA-Pro-Cha- 
Gly-Nva-His-Ala-Dpa-NH2  (Calbiochem)  as  substrate,  and  residual 
MMP-7  activity  was  measured  by  cleavage  of  MCA-Pro-Leu-Gly-Leu- 


Dpa-Ala-Arg-NH2  (Calbiochem)  with  monitoring  at  Aax  320  nm  and  AQm 
405  nm. 

Phage  Selection  of  MMP-9  Substrates— The  substrate  phage  library 
(2  X  1010  phage)  was  incubated  with  2.5  pg/ml  MMP-9  in  50  mM  Tns, 
pH  7.4,  100  mM  NaCl,  10  mM  CaCl2,  0.05%  Brtf-35,  and  0.05%  BSA  for 
1  h  at  37  °C.  A  control  selection  was  performed  without  protease.  The 
cleaved  phage  were  separated  from  the  non-cleaved  phage  by  immu- 
nodepletion.  100  pg  of  anti-FLAG  monoclonal  antibody  (Sigma)  was 
added  to  the  phage  samples  and  then  incubated  for  18  h  with  rocking  at 
4  °C.  The  phage-antibody  complexes  were  precipitated  by  the  addition 
of  100  pi  of  Pansorbin  (Calbiochem).  The  cleaved  phage  remaining  in 
the  supernatant  were  amplified  using  K91  E.  coli  as  described  previ¬ 
ously  (32-34)  and  were  then  used  for  subsequent  rounds  of  substrate 
selection. 

Substrate  Phage  ELISA — Hydrolysis  of  individual  phage  substrates 
was  measured  using  a  modified  ELISA.  96-well  microtiter  plates  were 
coated  with  anti-M13  antibody  (Amersham  Pharmacia  Biotech,  2.5 
fig/ml),  in  phosphate-buffered  saline,  overnight  at  4  °C.  After  coating, 
the  wells  were  blocked  for  1  h  at  room  temperature  in  TBS-T  (50  mM 
Tris,  pH  7.8,  150  mM  NaCl,  0.2%  Tween  20)  containing  10  mg/ml  BSA. 
After  blocking,  150  p\  of  supernatant  from  an  overnight  phage  culture 
was  added  to  each  well  and  incubated  for  2  h  at  4  °C  to  allow  for  phage 
capture.  Unbound  phage  were  removed  with  three  washes  of  ice-cold 
TBS-T.  To  asses  hydrolysis,  MMP-9  (2.5  pg/ml)  was  added  to  the  ap¬ 
propriate  wells  in  Incubation  Buffer  (50  mM  Tns,  pH  7.5, 100  mM  NaCl, 
10  mM  CaCl2,  0.05%  BSA,  0.05%  Brij-35,  50  pM  ZnCl2)  for  2  h  at  37  °C. 
Control  wells  lacked  protease.  The  protease  solution  was  removed,  and 
the  wells  were  washed  four  times  with  ice-cold  TBS-T.  To  measure 
hydrolysis  of  the  peptides  on  phage  by  MMP-9,  anti-FLAG  polyclonal 
antibody  (1.8  pg/mi  in  TBS-T  with  1  mg/ml  BSA)  was  added  to  each  well 
and  the  plates  were  incubated  at  4  °C  for  1  h.  Binding  of  anti-FLAG 
antibody  to  FLAG  epitope  was  measured  with  a  horseradish  peroxi¬ 
dase-conjugated  goat  anti-rabbit  IgG  antibody  (Bio-Rad)  followed  by 
detection  at  490  nm.  The  extent  of  hydrolysis,  taken  as  a  measure  of 
substrate  hydrolysis,  was  calculated  by  the  ratio  of  the  optical  density 
at  490  nm  of  the  protease-treated  samples  versus  samples  lacking 
protease. 

Determination  of  Scissile  Bonds — The  cleavage  site  for  MMP-9 
within  peptide  substrates  was  determined  using  MALDI-TOF  mass 
spectrometry.  MMP-9  (25  nM)  was  incubated  with  100  pM  amounts  of 
each  peptide,  independently,  in  50  mM  Tris,  pH  7.5,  100  mM  NaCl,  10 
mM  CaCl2  for  2  h  at  37  °C.  The  mass  of  the  cleavage  products  was 
determined  using  a  Voyager  DE-RP  MALDI-TOF  mass  spectrometer 
(PerSeptive  Biosystems,  Framingham,  MA).  Following  hydrolysis,  the 
peptide  samples  were  prepared  according  to  methods  described  previ¬ 
ously  (35-38).  In  all  cases,  no  other  peaks  that  corresponded  to  other 
potential  cleavage  products  were  observed. 

Kinetic  Measurements  of  Peptide  Hydrolysis — The  kinetic  parame¬ 
ters  of  substrate  hydrolysis  were  measured  using  a  fluorescamine  in¬ 
corporation  assay  that  has  been  described  previously  (21,  31,  34,  39). 
Briefly,  MMP-9,  MMP-7,  or  MMP-13  were  incubated  with  individual 
peptide  substrates  at  concentrations  ranging  from  100  to  800  pM  in  50 
mM  Tris,  pH  7.5, 100  mM  NaCl,  10  mM  CaCIa,  50  pM  ZnCl2.  At  selected 
time  points,  the  reactions  were  stopped  by  the  addition  of  1,10-phenan- 
throline.  Peptide  hydrolysis  was  determined  by  the  addition  of  fluores¬ 
camine  followed  by  detection  at  Aex  355  nm  and  Aem  460Tim.  The  data 
were  transformed  to  double-reciprocal  plots  (1/[S1  versus  1/u*)  to  deter¬ 
mine  Km  and  kCAt  (21,  31,  34,  39).  Similar  results  were  obtained  using 
different  batches  of  protease.  For  some  substrates,  Km  and  k nt  could  not 
be  determined  individually,  but  the  specificity  constant,  kc at/  Kmi  was 
derived  by  the  equation:  k^JKm  =  v/(E0)(S0)  (22). 

Computational  Molecular  Modeling — The  terminology  established 
by  Schechter  and  Berger  is  used  to  describe  the  subsites  in  the  protease 
active  site  and  the  correlating  positions  in  substrates  (40).  This  method 
describes  the  former  as  a  set  of  subsites  designated  as  S  (Slf  S2,  etc.)  and 
the  latter  as  P  Q?lf  P2,  etc.).  The  primed  sites  are  to  the  carboxyl- 
terminal  and  unprimed  sites  are  to  the  amino-terminal  side  of  the 
scissile  bond.  The  x-ray  structure  of  MMP-9  has  not  been  reported  to 
date.  We  have  previously  reported  a  computational  three-dimensional 
model  for  the  catalytic  domain  of  MMP-9,  which  was  generated  by 
homology  modeling  taking  advantage  of  sequence  alignment  and  con¬ 
servation  of  secondary  structural  elements  (41-43).  This  model  shows 
high  sequence  and  three-dimensional  homology  to  the  structure  of 
MMP-2  (42),  and  it  indicates  an  extended  cleft  for  binding  of  substrate 
that  can  accommodate  at  least  six  amino  acids  from  the  substrate.  We 
considered  molecular  models  for  representative  members  of  each  class 
of  substrates  (peptide  A10  and  peptide  Cll),  but  we  concluded  that 
there  is  not  sufficient  sequence  information  for  the  group  H  substrates 
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to  draw  meaningful  conclusions.  Therefore,  the  modeling  work  was 
concentrated  on  groups  I  and  HI.  The  molecular  models  of  the  se¬ 
quences  flanking  the  sites  of  MMP-9  cleavage  in  clones  A10  and  Cll 
(Ac-SGPLFYSVTA-NH2  and  Ac-SGRRLIHHTA-NH2,  respectively) 
were  constructed  using  the  SYBYL  molecular  modeling  program  and 
were  docked  into  the  active  site  of  MMP-9  (42).  Each  enzyme-substrate 
complex  contained  the  hydrolytic  water  molecule  positioned  in  the 
active  site  near  the  Glu-402  side  chain,  the  general  base  that  promotes 
it  for  the  hydrolytic  reaction.  The  enzyme-substrate  complexes  were 
energy-minimized  using  the  AMBER  5.0  software  package  (44, 45).  The 
protocol  for  the  energy  minimization  was  described  previously  (42). 

RESULTS 

Construction  of  the  Phage  Library— Vie  created  a  system  for 
displaying  random  hexamer  substrates  on  the  surface  of  phage. 
The  fUSE5  polyvalent  phage  display  vector  (24-26)  was  mod¬ 
ified  to  express  random  hexamers  at  the  amino  terminus  of  the 
genelll  protein,  and  an  octapeptide  FLAG  epitope  at  the  ami¬ 
no-terminal  end  of  the  random  hexamers.  The  library  com¬ 
prises  2.4  X  10s  independent  transformants,  giving  a  75% 
confidence  that  each  of  the  6.4  X  107  possible  random  hexamer 
sequences  are  represented  in  the  library.  Sequencing  of  phage 
confirmed  the  randomness  of  the  hexamer  insert.  Under  the 
selection  conditions,  greater  than  95%  of  phage  could  be  immu- 
nodepleted  using  an  anti-FLAG  antibody  (data  not  shown). 

Selection  of  Peptide  Substrates  for  MMP-9 — Optimal  sub¬ 
strates  were  selected  by  exposing  the  phage  library  to  a  recom¬ 
binant  form  of  the  catalytic  domain  of  MMP-9  expressed  in 
HEK  293  cells  (29).  The  recombinant  catalytic  domain  of 
MMP-9  was  purified  on  gelatin-Sepharose,  followed  by  ion  ex¬ 
change  chromatography  (29).  The  protease  was  activated  with 
2  mM  APMA  for  16  h  at  room  temperature,  and  the  active  site 
was  titrated  with  the  hydroxamate  inhibitor  Ilomastat  (29,  30). 
Phage  selections  were  performed  with  2.5  pgf ml  (56  nM)  active 
MMP-9.  Following  three  rounds  of  exposure  to  MMP-9,  indi¬ 
vidual  phage  clones  were  selected  for  sequencing.  An  alignment 
of  the  motifs  revealed  three  groups  of  structurally  distinct 
substrates  (Table  I).  Substrates  in  group  I  contain  the  motif 
with  sequence  Pro-X-X-Hy-(Ser/Thr).  Further  analysis  shows 
that,  within  this  larger  motif,  Arg  is  favored  at  P2  and  Ser/Thr 
is  favored  at  Px  (consensus  motif  noted  in  Table  I).  Substrates 
in  group  II  contain  a  Gly-Leu-(Lys/Arg)  motif.  Interestingly, 
both  groups  I  and  II  are  related  to  substrates  that  have  been 
described  previously  for  other  MMPs  and  are  somewhat  related 
to  sites  of  cleavage  in  collagen  (21-23).  Group  III  substrates, 
however,  appear  to  represent  a  novel  recognition  sequence  as  it 
contains  the  Arg-Arg-Hy-Leu  (group  IIIA)  and  Arg-X-Leu 
(group  IIIB)  motifs.  The  two  subclasses  that  comprise  group  III 
have  not  been  described  previously  as  a  substrate  motif  for 
MMP-9. 

The  ability  of  MMP-9  to  hydrolyze  each  of  the  phage  clones 
was  assessed  in  a  semiquantitative  manner  using  a  modified 
ELISA.  Individual  phage  clones  were  captured  into  microtiter 
plates  using  anti-M13  antibody.  Captured  clones  were  exposed 
to  MMP-9  (2.5  pg/ml).  A  polyclonal  anti-FLAG  antibody  was 
used  to  measure  the  liberation  of  the  FLAG  epitope  by  hydrol¬ 
ysis  of  the  substrate  with  MMP-9.  Results  are  expressed  as  the 
extent  of  hydrolysis  relative  to  untreated  phage  clones  (Table 
I).  In  general,  the  group  I  substrates  were  hydrolyzed  most 
efficiently.  However,  a  number  of  the  substrates  within  groups 
II  and  III  were  cleaved  to  the  same  degree  as  group  I 
substrates. 

Identification  of  Scissile  Bonds  within  MMPS  Sub¬ 
strates — To  identify  the  position  of  the  scissile  bonds,  10  pep¬ 
tides,  representing  each  of  the  three  groups  of  substrates,  were 
designed  and  synthesized.  Following  exposure  to  MMP-9,  the 
scissile  bond  was  determined  by  analyzing  the  cleavage  prod¬ 
ucts  by  MALDI-TOF  analysis  (Table  II).  This  analysis  revealed 


Table  I 

Sequence  alignment  and  grouping  of  phage  selected  by  MMPS 
Phage  from  the  third  round  MMP-9  selection  were  selected  for  their 
ability  to  be  cleaved  by  MMP-9  and  sequenced.  The  substrates  are 
separated  into  three  groups  based  on  distinguishable  motifs.  Residues 
that  define  each  group  are  in  bold.  The  ability  of  MMP-9  to  cleave  the 
substrates  on  phage  was  measured  in  a  modified  ELISA  format.  A  rank 
order  of  preference  within  each  group  is  demonstrated  as  a  function  of 
phage  cleaved  by  MMP-9  relative  to  a  non-treated  control.  This  is 
expressed  as  percentage  of  hydrolysis.  The  results  of  two  independent 
experiments  are  shown.  Hy,  hydrophobic  amino  acids. 

%  Hydrolysis 

Group  Peptide  sequence  Clone  ”  ” 

I  P  L  F  Y  S  V  A10  100  100 

K  I  P  R  T  L  T  All  100  100 

P  L  R  L  S  W  A12  100  100 

P  R  A  V  S  T  A6  100  99 

K  G  P  R  Q  I  T  C15  100  98 

P  R  P  It  S  G  012  82  94 

W  P  L  G  L  A  D35  89  91 

F  R  P  R  S  I  T  D36  87  91 

R  L  P  V  G  L  T  D20  89  90 

R  S  P  K  S  L  T  D21  88  90 

P  V  W  L  A  A  D22  85  90 

IHPSSLTA  D4  87  89 

P  A  S  F  T  S  D29  87  86 

G  Q  P  H  Y  L  T  B1  79.6  85 

M  K  P  A  S  W  T  B2  77.5  77 

T  H  P  Y  T  M  T  B10  74.1  82 

T  P  A  Y  M  L  T  A13  78.9  72 

PLY.LT  D2  79  82 

P  G  L  J  G  T  D1  75  85 

P  R  S  I  S  N  D24  82  82 

R  L  P  A  S  Y  T  D30  82  60 

N  P  P  R  Y  L  T  D5  70  67 

P  K  T  Q  I  S  D8  73  65 

S  V  P  R  H  F  T  D17  70  52 

L  L  P  A  W  L  T  D3  56  48 

T  H  P  Y  T  M  T  D14  50  47 

L  R  P  A  K  S  T  A18  28.9  30 

SGPSTST  A9  13.1  5.8 

Consensus  P  R  S/T  Hy  S/T 

II  GSGLKA  A7  79.8  76 

A  M  G  L  K  S  B9  73.3  86 

K  V  G  L  R  T  B8  13.3  12 

mA  G  R  R  L  X  H  H  Cll  94.9  74 

HPRRSIT  Cl  13.9  29 

G  R  R  L  L  S  R  A16  20.2  11 

A  L  R  R  L  E  T  C13  8.3  24 

FYKRVLT  B6  11.7  11.1 

F  R  R  I  C  V  C7  17.9  0 

V  F  F  R  R  Q  T  A  C8  8.9  0 

TTTR  GLARNITA  B3  66  70 

FGSRYLTA  A19  45  50 

Q  D  R  Y  L  N  T  C6  -  40  27 


that  each  substrate  contains  a  hydrophobic  residue  at  the  Pr 
position.  The  substrates  in  group  I  contain  Pro  at  the  P3  posi¬ 
tion.  The  single  peptide  from  group  II  contained  a  Gly  at  Px, 
Leu  at  Pr>  and  a  Lys  at  P2„  The  motifs  of  groups  I  and  II,  and 
locations  of  their  scissile  bonds,  are  similar  to  MMP  cleavage 
sites  in  collagen  and  gelatin  (21—23).  Interestingly,  however, 
the  peptides  from  group  III  were  all  cleaved  after  an  Arg 
residue,  and  in  several  cases  an  Arg-Arg  occupied  the  P2  and  P 1 
positions. 

Kinetic  Characterization  of  Substrate  Hydrolysis  by  MMP- 
9 — The  Michaelis  constant  (Km)  and  first-order  rate  constant  of 
substrate  peptide  turnover  (kcat)  were  measured  by  incubating 
a  range  of  each  peptide  with  MMP-9.  Peptide  hydrolysis  was 
measured  by  incorporation  of  fluorescamine  onto  newly  formed 
amino  termini  as  previously  described  (31,  33,  34).  From  these 
measurements,  and  Km  were  derived  for  each  peptide  using 
-  double-reciprocal  plots  of  1/[S]  versus  Uv^  Results  from  this 
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Table  II 

Kinetic  analysis  ofMMP-9  peptide  substrates  based  on  phage  clones 

Each  peptide  was  incubated  with  protease  and  hydrolysis  was  detected  as  described  under  “Experimental  Procedures.”  The  peptides  are  named 
according  to  their  phage  clone  designation.  All  peptides  were  synthesized  with  acetylated  NR*  termini  and  amidated  COOH-termmi.  Allml  and 
Allm2  are  analogues  of  peptide  All.  Changes  are  indicated  in  bold.  Scissile  bonds  are  designated  with  1 .  Amino  acids  to  the  nght  of  the  i  are 
Drimed  residues  (. .  .Pr,  P,.,  etc.),  and  amino  acids  to  the  left  are  non-primed  residues  (. .  .P2,  P^  (40).  **,  measurements  were  not  made;  ,  not 


Peptide 


Sequence 


MMP-9 

MMP- 

13 

MMP-7 

Ac JKm 

Ac*t 

Km 

ncm 

AoA 

m'1  s'1 

s'' 

mu 

i r1  s'1 

188,000 

703 

3.7 

15,400 

13,800 

67,500 

86.4 

1.3 

23,500 

7,000 

61,000 

178 

2.9 

12,400 

1,300 

160,000 

224 

1.4 

15,200 

24,000 

2000 

9.4 

4.7 

** 

** 

25,600 

20 

0.5 

»* 

** 

8800 

9 

1 

** 

** 

5800 

78 

13.5 

** 

** 

12,400 

30 

2.4 

12,300 

41,000 

13,200 

11 

0.8 

* 

1,000 

1200 

** 

** 

** 

** 

♦ 

** 

** 

** 

** 

C15 

All 

A6 

Allml 

Allm2 

A10 

B1 

A18 

A7 

A19 

A16 

Cll 


SGKGPRQ 1  IT  A 
SGKIPRTl  LTA 
SGPRA 1  VSTTA 
SGKIPRR I  LTA 
SGKIPRT  l  ATA 
SGPLF 1  YSVTA 
SGQPHY  i  LTTA 

sglrpak  I  STA 

SG  i  LKALMITA 
SGFGSRYl  LTA 
SGRR  i  LLSRTA 
SGRR  l  LIHHTA 


analysis  are  shown  in  Table  II.  Among  the  peptides,  the  kcat 
values  ranged  from  9  s_1  to  703  s"1,  and  Km  values  were 
generally  in  the  high  micromolar  to  low  millimolar  range.  Over¬ 
all,  peptides  from  group  I  appeared  to  represent  the  best  sub¬ 
strates,  with  peptide  CIS  having  a  kCELJKm  ratio  of  180,000  M_1 
s-1,  the  highest  of  any  of  the  substrates.  Peptide  All,  which 
closely  matches  the  consensus  recognition  motif  of  group  I,  also 
exhibits  a  high  kcJ  Km  ratio  (67,500  M"1  s_1)  and  considerable 
selectivity  for  MMP-9. 

Interestingly,  the  three  peptides  cleaved  most  efficiently  by 
MMP-9  were  from  group  I  and  contained  Arg  at  the  P 2  position. 
This  led  us  to  hypothesize  that  group  I  and  III  substrates  might 
represent  a  larger  set  of  substrates  whose  relationship  to  one 
another  is  not  entirely  evident  from  the  sequences  of  only  100 
clones.  To  test  this  idea,  we  synthesized  a  mutant  peptide 
(Allml)  that  contains  a  Thr  -»  Arg  substitution  at  Px  within 
the  context  of  the  sequence  of  the  All  peptide.  Hence,  the 
mutant  peptide  contained  elements  of  both  group  I  (Pro  at  P3 
and  Leu  at  Pr)  and  group  III  (Arg  at  P2  and  Px).  Peptide  Allml 
had  a  JK^  twice  that  of  the  parent  peptide,  an  effect  result¬ 
ing  primarily  from  an  increase  in  kcat.  This  finding  suggests  that 
the  substitution  of  Arg  at  V1  lowers  the  transition  state  energy  of 
the  protease-substrate  interaction.  This  observation  also  indi¬ 
cates  that,  within  the  Pro-X-X-Hy-(Ser/Thr)  motif,  Arg  residues 
at  P2  and  Px  are  favored  for  MMP-9,  and  that  the  substrates  from 
groups  I  and  III  may  have  a  related  binding  mechanism. 

In  almost  every  case,  the  MMP-9  substrates  selected  from 
the  phage  library  contain  a  hydrophobic  residue  at  Pi»,  a  find¬ 
ing  that  is  entirely  consistent  with  the  fact  that  MMPs  are 
known  to  have  a  deep  hydrophobic  Sj.  pocket  (1,  2,  46).  To 
assess  the  contribution  of  this  hydrophobic  residues  to  sub¬ 
strate  binding  and  to  substrate  turnover,  we  synthesized  an¬ 
other  mutant  peptide  (Allm2)  based  on  the  sequence  of  peptide 
All,  but  containing  Ala,  rather  than  Leu,  at  Py  (Ac- 
SGKIPRT  l  ATA-NH2).  This  substitution  had  deleterious  ef¬ 
fects  on  both  k^t  and  Km,  and  reduced  the  kcaJKm  ratio  nearly 
30-fold.  These  results  are  consistent  with  the  idea  that  the  Si- 
subsite  ofMMP-9  coordinates  substrate  binding  and  also  influ¬ 
ences  the  rate  of  hydrolysis.  Importantly,  however,  even  this 
mutant  peptide  had  a  measurable  kcaJKm  ratio  (2000  M  1  s  1), 
indicating  that  efficient  hydrolysis  can  be  enacted  by  MMP-9  if 
the  rest  of  the  substrate  sequence  is  optimal. 

Kinetic  Characterization  of  Substrate  Hydrolysis  by  MMP-7 
and  MMP-13— Since  many  of  the  selected  substrates  contained 
a  motif  similar  to  that  described  for  other  MMPs  (P-X-X-Hy), 


we  measured  the  degree  to  which  MMP-7  and  MMP-13  could 
cleave  these  MMP-9  substrates.  These  two  MMPs  were  used  for 
comparison  because  substrates  for  both  proteases  have  been 
selected  using  a  similar  phage  display  approach.  Interestingly, 
most  of  the  substrates  we  tested  were  cleaved  more  efficiently 
by  MMP-9.  The  ratios  ranged  from  2.6-  to  47-fold 

higher  for  MMP-9  than  for  either  MMP-7  or  MMP-13  (Table  II). 
Only  peptide  A7  deviated  from  this  trend.  It  is  also  worth 
noting  that,  although  the  substrates  in  group  IIIA  (Arg-Arg) 
were  not  cleaved  rapidly  by  MMP-9,  we  were  unable  to  detect 
any  hydrolysis  of  these  peptides  by  either  MMP-7  or  -13. 

Modeling  Substrate  Interactions  with  MMPS— Molecular 
modeling  studies  were  conducted  to  help  visualize  how  sub¬ 
strates  might  dock  into  the  enzymatic  cleft  of  MMP-9.  Energy- 
minimized  models  of  MMP-9  were  constructed  according  to 
procedures  described  under  “Experimental  Procedures,  and 
then  docked  with  peptides  A10  (Ac-SGPLFYSVTA-NH2)  repre¬ 
senting  group  I  and  Cll  (Ac-SGRRLIHHTA-NH2)  representing 
group  III. 

A  primary  feature  of  the  group  I  substrates  is  a  Pro  residue 
at  the  P3  position.  The  unique  conformational  features  of  Pro 
introduce  the  appropriate  “bend”  needed  for  the  optimal  sub¬ 
strate  binding  at  this  site.  This  Pro  residue,  occupying  the  S3 
subsite  of  MMP-9,  is  illustrated  in  Fig.  1A  (white  arrow).  The 
group  I  substrates  also  contain  an  invariant  hydrophobic  resi¬ 
due  at  the  Pr  position.  This  residue  protrudes  into-the  deep  Sr 
pocket  of  the  protease,  as  depicted  by  the  orange  arrows. 

Amino  acids  with  long  basic  side  chains  at  P2  and  P lt  such  as 
Arg,  are  the  defining  features  of  the  substrates  in  group  HI. 
Although  the  presence  of  these  residues  at  P2  and  Px  is  some¬ 
what  surprising,  the  energy-minimized  models  support  the  ob¬ 
servation  that  these  residues  bind  favorably.  An  Arg  at  P2  is 
likely  to  interact  with  the  backbone  carbonyl  moieties  of  His- 
405,  Gly-408,  and  the  side  chain  of  Asp-410,  all  of  which  con¬ 
tribute  to  the  S2  subsite  within  MMP-9  (white  arrow  in  Fig. 
LB).  These  electrostatic  interactions  are  predicted  to  contribute 
to  favorable  binding  of  this  class  of  substrates.  Many  of  the 
group  III  substrates  also  contain  an  Arg  residue  at  Pj.  The 
favorable  interaction  of  Arg  into  the  Sx  subsite  can  be  explained 
by  the  somewhat  unusual  nature  of  this  subsite.  It  is  essen¬ 
tially  a  hydrophobic  binding  surface  that  would  be  predicted  to 
accommodate  the  hydrophobic  side  chains  of  amino  acids  such 
as  Ala,  Phe,  and  Tyr.  However,  the  docking  studies  of  peptides 
with  Arg  at  P2  show  that  the  hydrophobic  surface  of  the  S2 
subsite  could  also  bind  to  the  extended  methylene  group  in  the 
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Fig.  1.  Three-dimensional  models  of  distinct  binding  modes  between  substrate  and  MMP-9.  To  ascertain  whether  peptides  from 
different  substrate  families  exhibit  different  modes  of  binding  in  the  active  site  of  MMP-9,  models  with  two  representative  peptides  binding  to 
MMP-9  were  constructed.  The  stereo  view  represents  the  energy-minimized  complexes  of  peptide  A10  (SGPLFYSVTA,  panel  A)  and  peptide  Cll 
(SGRRLLSRTA,  panel  B )  in  the  active  site  of  the  MMP-9  catalytic  domain.  The  active  site  of  MMP-9  is  shown  as  a  green  Conolly  surface,  and  the 
peptide  atoms  are  colored  according  to  type  (carbon,  white ;  oxygen,  red;  nitrogen,  blue).  The  catalytic  zinc  ion  is  represented  as  a yellow  sphere ,  and 
the  backbone  of  MMP-9  outside  the  active  site  is  shown  in  magenta.  The  orange  arrows  in  both  panels  show  the  deep  hydrophobic  pocket  at  the 
Sr  subsite.  In  panel  A,  the  white  arrow  shows  the  Pro  of  substrate  A10  penetrating  well  into  the  S3  pocket.  In  contrast,  a  Gly  from  substrate  Cll 
occupies  this  subsite  (panel  B). 


side  chain  of  Arg.  In  addition,  the  hydrophobic  channel  of  the 
S2  subsite  contains  the  backbone  carbonyl  of  Pro-180,  which  is 
likely  to  engage  in  electrostatic  interaction  with  the  basic  side 
chain  of  Arg,  stabilizing  the  interaction. 

DISCUSSION 

Because  of  their  association  with  a  number  of  diseases,  the 
MMPs  have  received  considerable  attention  as  drug  targets 
(15-17).  Much  of  the  effort  in  this  area  has  focused  on  the 
design  of  small  molecule  antagonists  with  two  primary  fea¬ 
tures;  1)  a  hydroxamate  moiety  that  binds  to  the  proteases 
catalytic  zinc,  and  2)  a  rather  large  hydrophobic  moiety  that 
fits  into  the  deep  Sr  pocket  present  in  all  MMPs  (30).  This 
synthetic  strategy  has  focused  structure- activity  studies  to  es¬ 
sentially  two  positions  within  the  catalytic  pocket.  An  under¬ 
standing  of  the  interactions  between  the  substrate  and  other 
key  subsites  within  the  catalytic  pocket  is  lacking. 

We  have  identified  three  families  of  peptide  substrates  for 
MMP-9  that  each  appear  to  interact  differently  with  the  cata¬ 
lytic  cleft  of  the  protease.  Substrates  in  group  I  are  cleaved 
most  efficiently  by  MMP-9.  These  peptides  all  contain  a  Pro  at 
P3  and  a  large  hydrophobic  residue  at  Pr.  In  this  respect,  the 
group  I  substrates  are  similar  to  collagen-like  sequences  that 
are  known  to  be  cleaved  by  the  MMPs  (19,  20).  A  prior  analysis 
of  a  small  series  of  synthetic  peptides  based  on  collagen,  and 
containing  a  similar  Pro-X-X-Hy  core,  showed  kCAJKm  ratios 
ranging  from  340  to  1000  mM'1  h_1.  These  values  are  -100 
fold  lower  than  those  exhibited  by  the  best  peptides  reported 
here.  Consequently,  the  added  diversity  afforded  by  phage  li¬ 
braries  allowed  us  to  identify  better  substrates  and  a  wider 
range  of  substrates.  Interestingly,  many  different  MMPs  rec¬ 
ognize  the  Pro-X-X-Hy  core  sequence.  Can  subsite  interactions 
at  positions  other  than  P3  and  Pr  generate  selectivity  within 
this  family  of  sequences? 


Apparently,  individual  MMPs  do  exhibit  a  great  deal  of  se¬ 
lectivity  for  peptides  containing  the  P-X-X-Hy  core  sequence. 
For  example,  most  of  the  MMP-9  substrates  within  group  I  are 
selective  for  MMP-9  over  MMP-7  and  MMP-13.  Some  of  the 
Acat/K m  ratios  for  these  peptides  are  up  to  47-fold  higher  for 
MMP-9  than  for  the  other  MMPs  tested.  These  findings  suggest 
that  substrate  selectivity  can  be  conferred  by  subsite  interac¬ 
tions  outside  of  the  dominant  P3  and  Pr  subsites  that  are 
common  among  MMP  substrates. 

Since  phage  substrate  selections  have  been  performed  for 
MMP-3,  -7,  -9,  and  -13,  an  analysis  of  the  frequency  by  which 
individual  residues  occupy  distinct  subsites  can  be  used  as  a 
first  test  of  this  idea.  This  comparison  reveals  considerable 
distinction  in  the  residues  that  occupy  the  P2,  P*,  and  P2» 
subsites.  Nearly  one-third  of  all  group  I  substrates  for  MMP-9 
contain  Arg  at  P2.  Although  Arg  can  also  be  found  at  P2  in 
peptide  substrates  for  MMP-13,  its  frequency  is  much  lower 
than  in  the  MMP-9  substrates  we  selected  (47).  Furthermore, 
Arg  is  rarely,  if  ever,  found  at  P2  in  MMP-3  or  -7  peptide 
substrates  (32).  Ser  or  Thr  most  frequently  occupies  the  Px 
position  of  the  MMP-9  substrates.  However,  a  Gly  residue  is 
preferred  at  this  position  by  MMP-13,  and  Asp  or  Glu  are 
preferred  by  MMP-3  and  -7  (32,  47).  Significant  differences  are 
also  observed  at  P2>.  In  the  group  I  MMP-9  substrates,  23  of  28 
substrates  have  Ser  or  Thr  at  P2',  but  neither  residue  is  prev¬ 
alent  at  this  subsite  in  the  substrates  selected  for  the  other 
MMPs  (32,  47).  These  observations  support  the  contention  that 
subsite  interactions  other  than  P3  and  Pr  have  a  significant 
impact  on  substrate  selectivity  among  the  MMP  family. 

Here  we  observed  two  additional  families  of  substrates  for 
MMP-9  that  are  distinct  from  the  P-X-X-Hy  family.  Group  II 
substrates  were  selected  least  frequently  and  contain  only 
three  members,  making  it  difficult  to  identify  subsite  prefer- 


20577 


Substrate  Recognition  by  Matrix  Metalloproteinase-9 


Table  HI 

A  comparison  of  the  MMP-9  substrate  consensus  with  cleavage  sites  in 
known  protein  substrates 

The  group  I  MMP-9  consensus  motif  is  compared  to  the  cleavage  sites 
in  proteins  known  to  be  cleaved  by  MMP-9.  The  protein  substrates  are 
divided  into  two  groups:  collagen  and  non-collagen.  The  residues  in  the 
protein  substrates  that  match  the  MMP-9  consensus  are  shown  in  bold. 
Scissile  bonds  are  identified  by  j  . 
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ences  outside  of  the  Gly-Leu-(Lys/Arg)  consensus  motif  that 
occupies  the  P2,  Pr,  and  P2,  positions.  Group  III  represents  a 
novel  substrate  preference  for  MMP-9.  As  a  whole,  group  III 
substrates  were  cleaved  far  less  efficiently  than  either  group  I 
or  group  II.  Nevertheless,  in  the  context  of  the  whole  phage,  at 
least  one  of  the  phages  in  this  family  (Cll)  was  cleaved  nearly 
as  well  as  most  of  the  group  I  substrates.  An  overriding  feature 
of  the  group  III  substrates  is  the  presence  of  Arg  at  P2  and  often 
at  Px.  Although  this  recognition  specificity  is  unexpected,  it  is 
corroborated  by  three  other  pieces  of  data.  First,  eight  sub¬ 
strates  from  group  I,  including  the  four  best  peptides  in  this 
study,  also  have  Arg  at  the  P2  position.  Second,  when  peptide 
All  from  group  I  was  mutated  to  contain  Arg  at  both  P2  and  Px, 
the  JKm  ratio  for  the  peptide  nearly  tripled.  Third,  docking 
studies  with  a  representative  group  III  peptide  illustrate  the 
favorable  interaction  of  Arg  with  the  S2  subsite  (Fig.  1). 

A  walk  across  the  model  of  the  catalytic  pocket  of  MMP-9, 
subsite  by  subsite,  illustrates  many  of  the  features  that  are 
likely  to  guide  substrate  interactions,  and  confer  recognition 
specificity.  On  the  non-primed  sides  of  the  scissile  bond,  the 
structural  features  of  the  S3  and  S2  subsites  are  consistent 
with  the  corresponding  substrates  that  we  selected.  As  with 
most  MMPs,  the  S3  subsite  is  a  hydrophobic  pocket  that  intro¬ 
duces  a  deviation  from  linearity  into  the  active-site  cleft.  This 
deviation  explains  why  Pro  is  favored  at  position  P3.  At  S2,  one 
finds  the  side  chain  of  Asp-410  contributing  its  hydrophilic 
characteristics,  and  providing  a  potential  salt  bridge  for  the 
guanidinium  group  of  Arg  that  is  often  found  at  P2.  MMP-13 
also  has  an  Asp  residue  at  this  position,  and  Arg  occasionally 
occupies  the  P2  positions  of  its  substrates.  In  contrast,  however, 
MMP-7  displays  an  Ala  at  this  site  rather  than  an  Arg  (3,  42), 
and  none  of  the  MMP-7  peptide  substrates  contain  Arg  at  P2  (32). 

Moving  across  the  cleft,  one  encounters  the  S1  and  Sr  pock¬ 
ets.  The  Sx  space  of  MMP-9  is  an  extended  hydrophobic  surface 
that  can  accommodate  a  wide  variety  of  amino  acids  (41).  In 
fact,  this  is  what  we  find.  Lake  other  MMPs,  the  Sr  subsite  is 
a  deep  hydrophobic  pocket,  and  it  is  likely  to  be  the  most 
important  substrate  recognition  point  in  the  active  site.  This 
notion  is  supported  by  the  fact  that  Leu  and  He  residues  dom¬ 
inate  the  Pr  position.  As  discussed  previously,  the  depth  of  this 
subsite  also  guides  the  binding  of  most  of  the  small  molecule 
antagonists  of  the  MMPs  (1,  3,  42). 

The  S2.  subsite,  a  small  hydrophobic  surface,  is  the  final 


Table  IV 

Identification  of  potential  MMP-9  substrates 
The  group  I  MMP-9  consensus  motif  was  used  to  query  the 
SWISSPROT  data  base  to  identify  potential  physiological  substrates  of 
MMP-9.  The  search  was  limited  to  proteins  that  contained  these  poten¬ 
tial  MMP-9  cleavage  sites.  The  putative  protein  substrates  and  poten¬ 
tial  cleavage  sites  are  shown. 


consensus:  R  S/T  Hy  S/T 

Potential  protein  substrate 
(residues) 

P  R 

T 

I 

T 

Kallikrein  14  (180-184) 

P  R 

T 

I 

S 

Ladinin  1  (371-375) 

P  R 

T 

V 

T 

Endoglin  (231-235) 

P  R 

T 

I 

S 

Endothilin  receptor  (82-86) 

P  R 

S 

L 

T 

Laminin  a3  chain  (1568-1572) 

P  R 

s 

L 

S 

Phosphate  regulating  neutral 

endopeptidase  (566-570) 

P  R 

T 

I 

S 

ADAM  2  (299-302) 

P  R 

S 

L 

T 
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Integrin  05  (57-61) 

significant  recognition  point  for  substrate  in  the  cleft  of 
MMP-9.  The  S2,  subsite  provides  opportunity  for  flexible  amino 
acids  to  adapt  conformations  that  would  allow  interactions 
with  the  solvent  on  substrate  binding  at  this  position,  including 
amino  acids  with  small  hydrophobic  or  large  hydrophilic  side 
chains.  We  find  that  Ser/Thr  are  preferred  at  this  position. 
Even  though  two-thirds  of  the  MMP-9  substrates  derive  their 
P2,  position  from  an  invariant  Thr  encoded  by  the  vector,  the 
remaining  one-third  derive  Thr  or  Ser  at  their  P2.  positions  by 
selection.  Interestingly,  the  character  of  the  S2-  subsite  differs 
among  MMP-9,  MMP-7,  and  MMP-13.  In  MMP-9  this  subsite 
essentially  comprises  Asp-188,  Gly-189,  Leu-190,  and  Met-422 
(D-G-L-M)  (3,  42).  The  corresponding  residues  in  MMP-7  are 
G-N-T-T,  and  in  MMP-13  are  S-G-L-I.  These  distinctions  are 
consistent  with  the  fact  that  the  substrates  for  each  protease 
also  differ  at  P2,  (see  above). 

Given  this  new  understanding  of  substrate  recognition  by 
MMP-9,  one  can  arrive  at  an  optimal  substrate  consensus 
(Pro-Arg-(Ser/Thr)- J  -(Leu/He)-(Ser/Thr)).  A  comparison  of  this 
motif  to  previously  reported  substrates  for  MMP-9  reveals 
some  interesting  similarities  and  distinctions  (Table  III)*  For 
example,  all  of  the  collagen  substrates  and  even  aggrecan, 
tissue  factor  pathway  inhibitor  and  galectin  3  contain  a  Pro  at 
the  P3  position  and  a  hydrophobic  amino  acid  at  the  Pj»  position 
(6,  48,  50-52).  However,  none  of  the  substrates  contain  resi¬ 
dues  found  to  be  optimal  at  other  positions.  The  remaining 
substrates  have  significantly  lower  homology  to  the  MMP-9 
consensus  recognition  motif  of  group  I  (8,  53). 

These  comparisons  raise  questions  as  to  whether  all  of  the 
physiologically  relevant  substrates  for  MMP-9  have  been  iden¬ 
tified.  We  reasoned  that  a  query  of  the  protein  data  bases  with 
the  optimal  recognition  motif  might  reveal  a  short  list  of  puta¬ 
tive  substrates.  Indeed,  this  search  revealed  only  nine  human 
proteins  that  contain  this  sequence  within  their  extracellular 
domain  (Table  IV).  Although  at  this  juncture  these  proteins  can 
only  be  considered  hypothetical  substrates  for  MMP-9,  many  of 
them  are  functionally  linked  to  MMP-9 -related  pathology.  For 
example,  MMP-9  influences  skin  blistering  in  the  autoimmune 
disorder  bullous  pemphigoid  (5,  55).  Interestingly,  two  of  the 
hypothetical  substrates  for  MMP-9,  ladinin  1  and  desmoglein 
3,  are  also  associated  with  autoimmune  skin  blistering  disor¬ 
ders  (56,  57).  Similarly,  MMP-9  has  been  suggested  as  a  regu¬ 
lator  of  the  angiogenic  switch  in  tumor  development  (11).  Two 
of  the  hypothetical  substrates  for  MMP-9,  integrin  05  and  en- 
doglin  1,  are  also  involved  in  angiogenesis  (49,  54,  58).  Along 
with  the  presence  of  a  potential  MMP-9  cleavage  site,  these 
functional  associations  lead  to  important  and  testable  hypoth¬ 
eses  about  physiologic  substrates  for  MMP-9. 
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It  is  estimated  that  proteases  comprise  nearly  2%  of 
the  human  genome.  Given  that  the  primary  structure 
of  all  known  proteases  will  soon  be  available,  an  im¬ 
portant  challenge  is  to  define  the  structure-activity 
relationships  that  govern  substrate  hydrolysis.  Ideally 
this  would  be  accomplished  on  a  genome-wide  scale. 
To  this  end,  we  have  developed  a  one-pot  phage  selec¬ 
tion  system  that  yields  the  substrate  recognition  pro¬ 
file  of  multiple  proteases  from  a  single  round  of  selec¬ 
tion.  The  system  meets  five  key  criteria:  (i)  multiple 
proteases  can  be  analyzed  simultaneously,  (ii)  prior 
knowledge  of  substrate  preference  is  not  required,  (iii) 
information  regarding  substrate  preferences  on  both 
side  of  the  scissile  bond  is  obtained,  (iv)  the  system 
yields  selective  substrates  that  distinguish  closely  re¬ 
lated  proteases,  and  (v)  semiquantitative  information 
on  substrate  hydrolysis  is  obtained,  allowing  for  the 
assignment  of  initial  rank-order  preferences.  As  an 
illustration,  a  phage  selection  with  a  mixture  of 
thrombin  and  factor  Xa  (serine  proteases)  along  with 
matrix-metalloproteinase-9  and  atrolysin  C  (metallo- 
proteinases)  was  performed.  Peptide  substrates  were 
identified  that  (i)  have  high  kcJKm  ratios,  (ii)  are  se¬ 
lective  for  individual  proteases,  and  (iii)  match  the 
sequences  of  known  physiological  substrates.  «  2001 

Academic  Press 


Proteases  are  key  regulators  of  a  wide  range  of  phys¬ 
iologic  processes  (1,2),  and  are  recognized  as  important 
and  tractable  drug  targets.  A  primary  aspect  of  pro¬ 
tease  function  is  substrate  recognition  and  hydrolysis, 
and  information  on  substrate  recognition  is  an  excel¬ 
lent  starting  point  for  elucidating  a  protease’s  biologi¬ 
cal  function.  Such  information  is  key  for  three  reasons, 
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(i)  it  provides  structure-activity  relationships  that 
guide  our  fundamental  understanding  of  catalysis,  (ii) 
it  can  be  used  to  interrogate  gene  and  protein  data¬ 
bases  to  identify  putative  physiologic  substrates,  and 
(iii)  it  can  also  guide  the  synthesis  of  pharmaceutical 
inhibitors.  Traditional  methods  for  establishing  sub¬ 
strate  recognition  profiles  involve  the  identification  of 
the  scissile  bond  within  a  given  protein  substrate. 
Then,  synthetic  peptide  variants  of  this  sequence  can 
be  screened  in  an  iterative  fashion  to  arrive  at  more 
optimized  substrates.  Even  though  it  can  be  fruitful, 
this  iterative  strategy  is  biased  toward  the  original 
substrate  sequence  and  is  also  tremendously  cumber¬ 
some.  Furthermore,  it  is  not  amenable  to  high- 
throughput  analysis. 

Given  that  the  first  draft  of  the  human  genome  is 
now  complete  (3,  4),  the  sequences  of  all  proteases  are 
essentially  known.  Current  estimates  indicate  that 
proteases  account  for  2%  of  the  human  genome  (5). 
Since  this  equates  to  less  than  1000  genes,  the  creation 
of  a  library  of  all  human  proteases  in  recombinant 
expression  systems  and/or  in  purified  form  is  well 
within  current  technological  capabilities.  Do  we  bio¬ 
chemically  analyze  each  protease  one  at  a  time  to  de¬ 
rive  function?  Or,  is  there  a  functional  genomics  strat¬ 
egy  that  can  be  applied  to  rapidly  characterize  the 
substrate  recognition  profile  of  all  proteases? 

We  reasoned  that  a  phage  display-based  approach 
could  be  modified  to  characterize  the  substrate  recog¬ 
nition  profile  of  all  proteases.  Substrate  phage  display 
has  already  been  employed  to  characterize  the  sub¬ 
strate  preferences  of  a  number  of  individual  proteases 
(6-11).  Phage  display  is  advantageous  for  several  rea¬ 
sons.  First,  libraries  containing  all  possible  combina¬ 
tions  of  a  random  hexamers/heptamers  can  be  con¬ 
structed.  Second,  substrates  displayed  on  phage  allow 
one  to  gain  information  on  substrate  preference  on 
both  sides  of  a  scissile  bond.  This  is  not  possible  with 
synthetic  libraries  of  fluorogenic  substrates.  Lastly, 
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phage  display  precludes  the  identification  of  pro¬ 
tein  substrates  that  are  then  analyzed  in  an  iterative 
fashion. 

Phage  display,  when  used  in  the  traditional  manner, 
is  a  heritable  system,  allowing  one  to  evolve  optimal 
substrates  (or  binders)  through  multiple  rounds  of  se¬ 
lective  pressure.  It  is  this  property,  however,  that  typ¬ 
ically  prevents  phage  from  being  used  to  obtain  sub¬ 
strate  recognition  profiles  for  multiple  proteases  at 
once.  Using  conventional  phage  selections,  a  few  phage 
substrates  that  best  meet  the  selection  criteria  ulti¬ 
mately  dominate  the  phage  population.  Consequently, 
a  conventional  selection  against  a  mixture  of  proteases 
would  yield  a  small  subset  of  substrates  that  are 
cleaved  by  only  one  or  a  few  of  the  proteases  within  the 
mixture.  Information  on  substrate  recognition  by  other 
proteases  in  the  mixture,  or  on  rare  substrates,  will  be 
lost  as  a  consequence  of  enrichment. 

Here  we  put  forth  a  strategy  for  obtaining  the  sub¬ 
strate  recognition  profile  of  multiple  proteases,  simul¬ 
taneously.  It  couples  one  round  of  substrate  phage 
display  with  a  substrate  ELISA2  to  deconvolute  the 
specificity  of  protease  substrates.  The  system  allows 
one  to  avoid  problems  caused  by  the  heritable  nature  of 
phage  display  by  activity  profiling  even  the  rarest  of 
substrates  after  a  single  round  of  protease  selection. 
Furthermore,  it  allows  all  possible  substrates  to  be 
tested  for  any  protease,  individually,  allowing  one  to 
compare  and  deconvolute  the  substrate  preferences  for 
large  panels  of  proteases  in  single  experiments.  This 
report  describes  the  ELISA-based  analysis  of  phage 
from  one  round  of  selection,  in  a  semiquantitative  fash¬ 
ion,  by  four  proteases. 

MATERIALS  AND  METHODS 

Restriction  enzymes  were  purchased  from  New  En¬ 
gland  Bio-Labs  and  Roche  Molecular  Biosciences. 
Thrombin  and  factor  Xa  were  from  Calbiochem.  Oligo¬ 
nucleotides  were  synthesized  by  IDT  Inc.  (Iowa).  Pep¬ 
tides  were  synthesized  by  Anaspec  Inc.  (San  Jose,  Ca.). 
All  chemical  and  buffers  were  from  Sigma  and  Fisher. 
The  catalytic  domain  of  MMP-9  was  expressed  and 
purified  as  described  previously  (12).  Atrolysin  C  was 
provided  by  Dr.  Jay.  W.  Fox  (University  of  Virginia). 

Construction  of  phage  library.  The  substrate  phage 
library  was  generated  using  a  modified  version  of  the 
fUSE5  phagemid  (13).  A  FLAG  epitope  was  engineered 
at  the  N-terminus  of  the  genelll  protein  by  annealing 
oligonucleotides  5'-CCGGGTTTGJCGTCGTCGTCTT- 
TGTAGTCGGTAC-3 '  and  5'-CGACTACAAAGAC- 
G ACG ACGAC AAAC-3 '  and  ligating  them  into  fUSE5 


2  Abbreviations  used:  ELISA,  enzyme-linked  immunosorbent  as¬ 
say;  MALDI-TOF,  matrix-assisted  laser  desorption  ionization-time 
of  flight;  MM9,  matrix  metalloproteinase-9. 


at  the  Kpnl  and  Xbal  restriction  sites.  The  random 
hexamers  were  generated  by  PCR  extension  of  the 
template  oligonucleotide  S'-GGGGAGGCCGACGTG- 
GCCGTC  ATC  AGGCGGCTC  AGGC-  (NNK)  6  ACGGCC- 
TCTGGGGCCGAAAC-3',  where  N  is  any  nucleotide 
and  K  is  either  G  or  T.  The  template  oligonucleotide 
also  encodes  a  peptide  linker  (SGGSG)  positioned  in 
between  the  FLAG  epitope  and  the  random  hexamer. 
An  extension  oligonucleotide  5'-AATTTCTAGTTTCG- 
GCCCCAGAGGC-3 '  and  the  template  oligonucleotide 
were  mixed  and  heated  at  65°C  for  2  min.  The  heating 
block  was  allowed  to  cool  passively  to  40°C  to  facilitate 
annealing  of  the  extension  oligonucleotide  to  the  tem¬ 
plate  oligonucleotide.  Elongation  of  the  template  oligo¬ 
nucleotide  was  performed  using  Sequenase  (USB)  (14). 
The  final  cDNA  product  was  precipitated  with  ethanol, 
resuspended  in  water,  and  digested  with  Shi.  The  DNA 
insert  and  fUSE5  were  mixed  and  ligated  at  a  5:1 
molar  ratio  and  electroporated  into  Escherichia  coli 
MCI 061  (F-).  Several  phage  were  selected  for  sequenc¬ 
ing  to  confirm  the  random  nature  of  the  phage  insert. 

Selection  of  phage  substrates  by  protease  mixture . 
An  aliquot  of  the  substrate  phage  library  (2  X  10 10 
phage)  was  incubated  with  the  protease  mixture 
(thrombin,  factor  Xa,  MMP-9,  and  atrolysin  C,  at  2 
p-g/ml  each)  in  50  mM  Tris,  pH  7.4,  100  mM  NaCl,  10 
mM  CaCl2,  0.05%  Brij-35,  and  0.05%  BSA  for  1  h  at 
37°C.  A  control  selection  contained  no  protease.  The 
phage  that  were  cleaved  by  protease  were  separated 
from  the  noncleaved  phage  by  immuno-depletion  by 
the  addition  of  100  jug  of  Ml  anti-FLAG  monoclonal 
antibody  (Sigma)  followed  by  incubation  with  Pan¬ 
sorbin  (Calbiochem).  The  cleaved  phage  remaining  in 
the  supernatant  were  amplified  using  K91  E.  coli  as 
described  previously  (6-8,  15). 

Analysis  of  substrate  hydrolysis  by  phage  ELISA. 
Hydrolysis  of  individual  phage  substrates  by  each  pro¬ 
tease  was  measured  using  a  modified  ELISA.  The  wells 
of  a  96-well  microtiter  plate  (Titertek)  were  coated  with 
anti-M13  antibody  (Pharmacia,  2.5  pg/ml  in  PBS), 
overnight  at  4°C.  After  coating,  the  wells  were  blocked 
for  1  h  at  room  temperature  in  TBS-T  (50  mM  Tris,  pH 
7.8,  150  mM  NaCl,  0.2%  Tween  20)  containing  10 
mg/ml  BSA.  Phage  were  captured  from  150  /xl  of  over¬ 
night  phage  culture  supernatant  in  the  antibody- 
coated  wells  for  2  h  at  4°C.  The  unbound  phage  were 
removed  with  three  washes  of  ice-cold  TBS-T .  To  asses 
hydrolysis,  protease  (2  /xg/ml)  was  added  to  the  appro¬ 
priate  wells  in  Incubation  Buffer  (50  mM  Tris,  pH  7.5, 
100  mM  NaCl,  10  mM  CaCl2,  0.05%  BSA,  0.05%  Brij- 
35)  for  2  h  at  37°C.  Control  wells  lacked  protease.  The 
protease  solution  was  removed  and  the  wells  were 
washed  four  times  with  ice-cold  TBS-T.  To  measure 
protease  hydrolysis  of  the  peptides  on  phage,  anti- 
FLAG  polyclonal  antibody  was  added  to  each  well  and 
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FIG  1  Diagram  of  the  “one-pot”  protease  substrate  phage  selection  scheme.  Protease  substrates  are  selected  from  the  phage  library  by 
incubation  with  four  proteases,  simultaneously.  The  phage  cleaved  by  protease  within  the  random  hexamer  substrate  region  of  the  phage  are 
separated  from  the  uncleaved  phage  with  anti-FLAG  antibody  and  Pansorbin  precipitation.  Individual  phage  clones  representing  protease 
substrates  are  then  tested  in  a  phage  ELISA  by  each  protease,  independently,  to  identify  which  protease(s)  cleaves  which  phage  substrate.  Clear 
represents  positive  signal  and  solid  represents  negative  signal. 


the  plates  were  incubated  at  4°C  for  1  h.  Binding  of 
anti-FLAG  antibody  to  FLAG  epitope  was  measured 
with  HRP-conjugated  goat  anti-mouse  IgG  (Bio-Rad) 
followed  by  detection  at  490  nm.  The  extent  of  hydro¬ 
lysis,  taken  as  a  measure  of  substrate  efficiency,  was 
calculated  by  the  ratio  of  the  OD  at  490  nm  of  the 
protease-treated  samples  versus  matched  samples 
lacking  protease.  Each  clone  that  was  hydrolyzed  more 
than  50%  by  any  of  the  proteases  was  sequenced. 

Determination  ofscissile  bonds  using  synthetic  pep¬ 
tides.  The  scissile  bond  within  synthetic  peptide  sub¬ 
strates  was  determined  using  MALDI-TOF  mass  spec¬ 
trometry.  Each  peptide  (100  jllM)  was  incubated  with 
protease  in  50  mM  Tris,  pH  7.5,  100  mM  NaCl,  10  mM 
CaCl2  for  2  h  at  37°C.  The  mass  of  the  cleavage  products 
was  determined  using  a  Voyager  DE-RP  MALDI-TOF 
mass  spectrometer  (PerSeptive  Biosystems,  Framing¬ 
ham,  MA).  Following  hydrolysis,  the  peptide  samples 
were  prepared  according  to  methods  described  previously 
(16-18).  In  all  cases,  no  peaks  corresponding  to  alterna¬ 
tive  cleavage  products  were  observed.  Assignment  of  the 
scissile  bond  and  numbering  of  residues  on  either  side  of 
the  scissile  bond  were  performed  according  to  nomencla¬ 
ture  established  by  Schechter  and  Berger  (19). 

Kinetic  measurements  of  peptide  hydrolysis.  The  ki¬ 
netic  parameters  of  substrate  hydrolysis  were  mea¬ 
sured  using  a  fluorescamine  incorporation  assay  that 
has  been  described  previously  (20-22).  Briefly,  pro¬ 
tease  was  incubated  with  individual  peptide  substrates 
at  concentrations  ranging  from  100  to  800  pM  in  50 
mM  Tris,  pH  7.5,  100  mM  NaCl,  10  mM  CaCl2.  At 
selected  time  points,  reactions  were  stopped  by  the 
addition  of  1,10-phenanthroline.  Peptide  hydrolysis 
was  determined  by  the  addition  of  fluorescamine  fol¬ 
lowed  by  detection  at  Aex  355  nm  and  Aem  460  nm.  The 
data  were  transformed  to  double  reciprocal  plots  (l/[5j 
vs  1/V|)  to  determine  Km  and  kat.  For  some  substrates, 
Km  and  Acat  were  not  determined  individually,  but  the 
specificity  constant,  kJKm,  was  derived  by  the  equa¬ 
tion:  kJKm  =  vj  {E0)  (S0). 


RESULTS 

A  Phage  System  Designed  for  a  One-Pot  Selection 

Our  objective  was  to  create  a  phage  system  capable 
of  profiling  the  substrate  specificity  of  multiple  pro¬ 
teases  in  a  single  mixture.  The  crux  of  this  phage 
system  is  a  phagemid  that  was  constructed  from 
fUSE5,  a  vector  that  encodes  the  polyvalent  expression 
of  genelll  (13).  The  phagemid  is  modified  such  that  a 
random  hexamer  is  displayed  as  an  N-terminal  fusion 
with  the  genelll  protein.  In  addition,  the  phagemid 
was  engineered  to  encode  a  FLAG  epitope  linked  to  the 
random  hexamer  via  a  peptide  linker  (SGGSG).  The 
library  comprises  2.4  x  108  independent  transfor¬ 
mants,  assuring  a  75%  confidence  that  every  combina¬ 
tion  of  hexamer  is  represented  (data  not  shown). 
Twenty  clones  were  sequenced  to  verify  the  random 
nature  of  the  hexamer  insert.  Greater  than  95%  of  all 
uncleaved  phage  could  be  precipitated  from  solution 
with  an  anti-FLAG  antibody  (data  not  shown). 

We  reasoned  that  the  advantages  of  phage  display 
could  be  preserved,  and  the  disadvantages  overcome,  by 
coupling  a  substrate  phage  selection  to  a  semiquantita- 
tive  ELISA  (Fig.  1).  The  purpose  of  the  ELISA  is  twofold. 
First,  it  allows  the  assignment  of  individual  substrates  to 
their  cognate  protease.  Second,  the  ELISA  gives  a  first 
approximation  of  the  rate  of  hydrolysis  by  each  protease. 
This  allows  the  identification  of  optimal  substrates,  obvi¬ 
ating  the  need  for  further  rounds  of  selection. 

One-Pot  Selection  of  Substrates  with  Serine  and 

Metalloproteinases 

We  reasoned  that  the  substrate  preference  of  multi¬ 
ple  proteases  could  be  determined  from  a  one-pot  se¬ 
lection  provided  that:  (i)  only  one  round  of  selection 
was  used,  and  (ii)  accurate  measures  of  the  hydrolysis 
of  each  phage  clone  could  be  derived  from  a  separate 
assay.  To  test  the  one-pot  phage  system,  we  combined 
two  serine  proteases,  thrombin  and  factor  Xa,  with 
two  metalloproteinases,  MMP-9  and  atrolysin  C. 
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The  hexamer  library  described  above  (2  X  10 10  phage) 
was  incubated  with  an  equimolar  amount  of  the 
protease  mixture  for  1.5  h  at  37°C.  An  anti-FLAG 
antibody  was  used  to  separate  the  cleaved  and  non- 
cleaved  phage  by  precipitation  with  Pansorbin. 
Cleaved  phage  were  recovered  from  the  supernatant 
and  used  to  infect  K91  bacteria.  Phage  clones  were 
selected  randomly  from  this  pool  for  further  character¬ 
ization.  The  enrichment  from  one  round  of  treatment 
was  fivefold. 

To  assess  the  hydrolysis  of  individual  clones,  phage 
were  captured  from  bacterial  supernatants  into  micro¬ 
titer  plates  using  an  anti-M13  antibody.  The  immobi¬ 
lized  phage  were  treated  with  each  protease  and  the 
extent  of  hydrolysis  was  measured  by  the  release  of  the 
FLAG  epitope.  Results  from  the  average  of  two  such 
ELISAs  are  shown  in  Table  1.  The  extent  of  hydrolysis 
of  each  phage  clone  is  expressed  as  the  percentage  of 
FLAG  epitope  removed  by  protease  treatment.  Approx¬ 
imately  17%  of  phage  tested  were  scored  as  nonsub¬ 
strates  (<20%  hydrolysis),  as  expected  from  the  one 
round  of  enrichment.  From  this  table,  the  preferred 
substrates  of  each  protease  are  clearly  evident.  Clones 
81  and  89  are  highly  specific  substrates  for  thrombin, 
whereas  clones  74  and  86  exhibit  similar  selectivity  for 
MMP-9.  Substrates  selective  for  factor  Xa  and  atroly- 
sin  C  are  also  evident.  Some  phage  are  cleaved  by 
multiple  proteases.  For  example,  phage  1  and  17, 
which  both  contain  an  Arg-Ser  bond,  are  cleaved  by 
thrombin  and  factor  Xa.  This  overlapping  recognition 
is  not  surprising  given  the  structural  homology  be¬ 
tween  the  two  proteases. 

From  the  results  shown  in  Table  1,  groups  of  pre¬ 
ferred  substrates  can  be  visualized  for  each  protease. 
By  aligning  the  substrates  cleaved  more  than  60%  by 
thrombin,  two  groups  of  substrates  become  apparent 
(Table  2).  The  consensus  motif  derived  from  a  combi¬ 
nation  of  the  groups,  (Gly/Pro) - (Arg/Lys) - i -Ser- (Trp/ 
Phe),  is  in  agreement  with  the  known  substrate  pref¬ 
erence  of  thrombin  and  matches  the  sequence 
surrounding  the  scissile  bond  in  several  physiologic 
substrates  for  thrombin,  including  the  thrombin  recep¬ 
tor  (protease-activated  receptor),  factor  V  and  factor 
VII  (23-25).  Similarly,  phage  substrates  identified  for 
factor  Xa  contain  a  distinct  recognition  motif  (Gly- 
Arg-l-X)  that  matches  well  with  the  prothrombin  ac¬ 
tivation  site,  the  main  physiologic  target  of  factor  Xa 
(Table  2)  (26). 

Comparison  of  One-Pot  Selection  to  Conventional 

Substrate  Phage  Selection 

Since  phage  selections  are  traditionally  performed 
by  three  rounds  of  enrichment,  we  compared  the  one- 
pot  approach  to  a  conventional  substrate  phage  selec¬ 
tion.  A  group  of  phage  from  the  one-pot  selection,  that 


were  hydrolyzed  at  least  55%  by  MMP-9,  were  com¬ 
pared  to  phage  we  selected  through  three  rounds  of 
hydrolysis  by  MMP-9.  As  shown  in  Table  3,  both  meth¬ 
ods  yielded  substrates  with  a  consensus  of  Pro-X-X- 
XHy,  where  X  is  any  amino  acid  and  XHy  is  a  hydropho¬ 
bic  amino  acid.  This  motif  is  nearly  identical  to  a  motif 
that  is  reported  to  represent  the  optimal  cleavage  se¬ 
quence  of  a  number  of  other  metalloproteinases  (7 ,  20, 
21).  These  observations  show  that  a  single  round  of 
selection  is  capable  of  yielding  excellent  substrates 
that  match  well  with  those  obtained  from  conventional 
selections. 

The  Substrate  ELISA  as  a  Semiquantitative  Indicator 

of  Substrate  Hydrolysis 

Experiments  were  conducted  to  determine  how  well 
the  substrate  phage  ELISA  predicts  the  rank-order 
preference  of  analogous  synthetic  peptides.  The  hydro¬ 
lysis  of  a  series  of  phage  substrates  for  MMP-9  was 
measured  as  a  function  of  time  (Fig.  2A)  and  compared 
to  the  kJKm  ratio  for  the  corresponding  synthetic 
peptides  (Fig.  2B).  Phage  A6  was  hydrolyzed  by  85%  at 
2  h  and  the  matching  peptide  had  a  k^J Km  ratio  of 
61,000  M"1  s'1.  Phage  A7  was  hydrolyzed  to  a  lesser 
degree,  50%  at  2  h.  Correspondingly,  the  A7  peptide 
had  a  kJKm  ratio  of  12,000  M'1  s"1.  The  worst  sub¬ 
strate,  phage  A16,  was  hydrolyzed  by  only  20%  at  2  h, 
and  its  peptide  exhibited  the  lowest  /rcat/Am  ratio,  only 
1100  M"1  s"1.  Both  methods  then  yield  the  same  rank 
order  of  preference  of  substrates,  A6  >  A7  >  A16. 
Similar  observations  were  made  when  phage  sub¬ 
strates  for  thrombin  (Fig.  2C)  were  compared  to  the 
analogous  synthetic  peptides  (Fig.  2D).  Clearly  the  use 
of  synthetic  peptide  substrates  provides  a  more  precise 
measure  of  Km  and  kcat,  but  the  ELISA  is  sufficient  to 
indicate  a  rank  order  of  preference  and  a  greater  num¬ 
ber  of  substrates  can  be  explored  quickly.  Our  experi¬ 
ments  indicate  that  the  substrate  phage  ELISA  can 
distinguish  substrates  across  a  range  of  kcJ Km  extend¬ 
ing  from  approximately  1000  to  60,000  M'1  s  ’.  Beyond 
these  lower  and  upper  limits,  distinction  is  more  diffi¬ 
cult,  but  possible.  It  is  likely,  however,  that  manipula¬ 
tion  of  the  ELISA  conditions  could  alter  the  dynamic 
range  of  detection. 

DISCUSSION 

Two  decades  ago,  the  Holy  Grail  of  biology  appeared 
to  be  the  knowledge  of  the  sequence  of  the  entire  hu¬ 
man  genome.  As  the  prospect  of  success  in  this  en¬ 
deavor  increased,  emphasis  shifted  toward  knowing,  or 
predicting,  the  structure  of  all  human  proteins  (27-30). 
Now,  even  these  genome-wide  structural  predictions 
appear  within  the  realm  of  possibility  (31).  Conse¬ 
quently,  emphasis  should  now  shift  toward  large-scale 
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TABLE  1 

Substrates  Obtained  from  a  “One-Pot"  Selection 


Protease  Clone 
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Note .  The  substrate  phage  library  was  incubated  with  a  mixture  of  thrombin  (Th),  factor  Xa  (Xa),  MMP-9 
(M9),  and  atrolysin  C  (At).  Phage  substrates  were  selected  as  described  under  Materials  and  Methods. 
Individual  phage  clones  were  tested  as  substrates  in  an  ELISA  assay.  The  extent  of  hydrolysis  is  expressed 
as  a  percentage  of  total  relative  to  untreated  controls,  and  is  the  average  of  two  independent  experiments. 
The  hexamer  sequence  displayed  by  individual  phages  was  derived  from  the  DNA  sequence  of  the ;  phage 
inserts.  Standard  error  between  duplicate  experiments  was  15%  or  less  of  the  measured  values.  Shaded 
clones  are  referred  to  in  text,  and  nonsubstrates  are  outlined. 
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TABLE  2 


Comparison  of  Phage  Substrates  to  Known 
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Note.  Several  of  the  optimum  substrates  for  thrombin  and  factor  Xa 
were  aligned  by  sequence  identity.  The  position  of  the  scissile  bond  was 
then  surmised  based  upon  the  hydrolysis  of  representative  synthetic  pep¬ 
tides.  From  this  alignment,  general  consensus  motif  was  derived  by  visual 
inspection,  which  in  turn  was  compared  to  the  amino  acid  sequence  within 
known  physiologic  substrates  for  each  protease. 


measures  and  predictions  of  function.  Here,  we  set 
forth  a  strategy  to  profile  the  catalytic  space  of  large 
sets  of  proteases.  In  principle,  this  strategy  makes  it 
possible  to  define,  and  directly  compare,  the  substrate 
recognition  profile  of  nearly  all  c/jc/o- peptidases. 

Four  proteases  representing  two  catalytic  classes 
were  used  to  test  the  system’s  ability  to  profile  catalytic 
space.  Within  this  set,  we  included  two  classical  serine 
proteases,  thrombin  and  factor  Xa,  for  which  the  sub¬ 
strate  specificity  has  been  well  characterized.  We  rea¬ 
soned  that  a  test  of  the  system  with  “knowns”  would 
indicate  its  ability  to  properly  report  substrate  recog¬ 
nition.  MMP-9  (matrix  metalloproteinase)  and  atroly- 
sin  C  (snake  venom  metalloproteinase)  were  used  as 
representatives  of  metalloproteinases,  whose  recogni¬ 
tion  profiles  are  not  well  established  (5).  Within  each 
class,  the  proteases  share  structural  similarity  and, 
reportedly,  have  overlapping  substrate  preferences.  A 
comparison  of  the  substrates  selected  for  each  protease 
to  the  reported  substrate  preference  of  each  enzyme 
demonstrates  the  power  of  the  approach. 


A  semiquantitative  readout  of  phage  hydrolysis  is 
required  to  establish  rank-order  preferences  among 
substrates  for  a  single  protease.  Such  information  is 
also  important  for  comparing  hydrolysis  of  a  single 
phage  by  more  than  one  protease.  To  date,  the  analysis 
of  phage  substrates  has  been  limited  to  cumbersome 
Western  blots  of  protease-treated  phage,  or  analysis  of 
synthetic  peptide  corresponding  to  individual  phage 
clones.  A  modified  ELISA  was  devised  to  measure  the 
hydrolysis  substrates  on  phage.  The  substrate  phage 
ELISA  allowed  for  the  identification  of  substrates  for 
each  protease  (Table  1).  In  addition,  when  equal  pro¬ 
tease  concentrations  are  used  to  cleave  phage  sub¬ 
strates,  a  rank-order  preference  for  each  substrate  by 
its  cognate  protease  can  be  assigned.  The  trends  of 
phage  hydrolysis  observed  in  the  ELISA  correlate  well 
with  the  kcJKm  ratios  of  the  corresponding  synthetic 
peptides  (Fig.  2).  The  semiquantitative  ELISA  will  dis¬ 
cern  substrates  whose  k,JKm  ratios  differ  by  less  than 
an  order  of  magnitude.  In  addition,  the  ELISA  can  be 
modified  into  a  time  course  or  a  protease  dose-response 
curve  (data  not  shown)  to  further  analyze  substrates  of 
interest.  Therefore,  the  scheme  described  in  this  study 
can  establish  accurate  first  approximations  of  sub¬ 
strate  preferences  between  related  proteases. 

Given  its  importance  to  hemostasis,  considerable  ef¬ 
fort  has  been  devoted  to  defining  the  substrate  recog¬ 
nition  properties  of  thrombin  (25,  32-35).  Therefore, 
we  felt  it  served  as  an  excellent  example  as  a  “known” 
test  case.  Thrombin  preferentially  cleaves  substrates 
with  a  sequence  of  Gly-Pro-Arg-  i  -Ser-Phe-Xcharged.  Us¬ 
ing  the  one-pot  approach,  we  have  identified  two 
groups  of  substrates  that  generally  match  this  struc¬ 
tural  preference  (Table  2).  In  fact,  a  consensus  motif 
derived  from  the  two  groups  of  substrates  matches 
many  of  thrombin’s  physiologic  substrates.  Thrombin 
is  somewhat  unique  in  that  it  has  a  strict  substrate 


TABLE  3 

Comparison  of  MMP-9  Substrates  Obtained  from  “One-Pot” 
and  Conventional  Selections- 


“One-pot”  selection  Conventional  selection 
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Note.  Phage  substrates  for  MMP-9  obtained  from  the  “one-pot”  selection 
were  compared  to  substrates  selected  from  the  same  library  using  three 
rounds  of  enrichment.  Sequences  were  aligned  by  inspection,  based  pri¬ 
marily  on  the  P-X-X-Hy  motif  that  has  been  reported  as  a  common  recog¬ 
nition  motif  for  metalloprot eases.  Identical  and  conserved  residues  are  in 
bold  text.  Hy;  hydrophobic  residues,  Po,  polar  residues. 
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FIG.  2.  Comparative  analysis  of  substrate  phage  ELISA  vs  synthetic  peptide  hydrolysis.  A  comparison  of  relative  rates  of  ^rolysis  of 
substrate  peptides  on  phage  with  the  hydrolysis  of  the  corresponding  synthetic  peptides  was  performed.  Phage  clones  from  a  conventiona 
selection  with  MMP-9,  A6  (•).  A7  P).  and  A16  (A)  were  cleaved  by  MMP-9  (A)  and  phage  clones  17  (•)  and  1  p)  from  the  one-pot  sys  em 
were  cleaved  by  thrombin  (C)  as  described  under  Materials  and  Methods.  The  rates  of  hydrolysis  of  the  time  course  ELISA  (A  and  ) 
the  A:  IK  of  the  corresponding  synthetic  peptides  (B.  MMP-9;  and  D.  thrombin)  were  analyzed  to  arrive  at  a  quantitative  comparison.  Each 
substrateELISA  was  performed  See,  and  the  standard  error  ranged  from  5  to  20%  between  experiments.  Each  kJKm  value  was  measured 
in  three  separate  experiments  with  standard  deviations  ranging  from  7  to  17%. 


preference  across  multiple  substrate  subsites.  How¬ 
ever,  our  observations  illustrate  that  the  one-pot  selec¬ 
tion  strategy  provides  the  same  type  of  information 
that  has  been  culled  from  the  analysis  of  thrombin’s 
individual  substrates  and  from  profiling  with  peptide 
libraries,  even  with  the  background  of  three  other  pro¬ 
teases. 

We  also  included  factor  Xa  in  the  selection  because  of 
its  close  relationship  to  thrombin.  Factor  Xa  is  similar 
to  thrombin  in  both  fold  and  substrate  preference  (5, 
26,  33).  Perhaps  not  surprisingly,  we  identified  a  num¬ 
ber  of  substrates  that  are  cleaved  by  both  factor  Xa  and 
thrombin.  Similar  to  the  Group  I  thrombin  substrates, 
most  of  the  substrates  selected  by  factor  Xa  contain 
Gly-Arg  at  P2  and  P,.  Although  the  number  of  “good” 
substrates  identified  for  factor  Xa  was  small,  the  resi¬ 
dues  at  P  i'  appear  to  be  more  variable  than  for  throm¬ 
bin.  This  finding  is  consistent  with  the  fact  that,  in  four 


physiologic  substrates  of  factor  Xa,  there  are  three 
different  Pj<  residues  (26). 

Even  though  thrombin  and  factor  Xa  have  a  similar 
substrate  recognition  profile,  the  one-pot  selection  re¬ 
vealed  substrates  that  distinguish  the  two  proteases. 
For  example,  substrates  81  and  89  are  completely  hy¬ 
drolyzed  by  thrombin,  but  factor  Xa  has  little  effect  on 
either.  A  further  illustration  of  this  ability  is  provided 
by  a  comparison  of  the  substrates  selected  by  MMP-9 
vs  atrolysin  C.  Although  atrolysin  C  and  MMP-9  are 
structurally  related,  phage  substrates  selective  for 
each  of  the  two  proteases  were  obtained  from  the  one- 
pot  selection.  Phage  74  and  86  are  cleaved  by  MMP-9, 
but  not  by  atrolysin.  Conversely,  phage  6  and  35  are 
cleaved  by  atrolysin  but  not  MMP-9.  The  ability  to 
identify  substrates  that  distinguish  closely  related  pro¬ 
teases  is  of  great  significance,  especially  if  the  strategy 
is  to  be  applied  to  large  groups  of  related  proteases. 


8 


KRIDEL,  CHEN,  AND  SMITH 


Prior  work  with  substrate  phage  has  employed  sub¬ 
traction  protocols  to  obtain  peptides  that  distinguish 
two  closely  related  proteases  (15).  Yet,  to  obtain  sub¬ 
strates  selective  for  one  protease  over  four  or  five  re¬ 
lated  proteases,  one  would  have  to  employ  a  matrix  of 
subtractions.  Therefore,  the  manipulations  involved  in 
subtractive  substrate  phage  display  are  too  compli¬ 
cated  and  involved  to  apply  to  a  large  set  of  proteases. 
The  fact  that  the  one-pot  approach  reveals  substrates 
that  are  selective  for  related  proteases,  and  can  be 
applied  to  multiple  proteases  at  once,  indicates  that  it 
would  be  of  great  value  in  obtaining  a  panel  of  sub¬ 
strates  that  distinguish  proteases  in  one  family.  In  this 
regard,  the  one-pot  strategy  is  unique. 

We  also  compared  the  one-pot  selection  approach  to 
a  conventional  substrate  phage  selection  involving 
multiple  rounds  of  selection.  Using  a  conventional  se¬ 
lection  with  MMP-9,  we  identified  substrates  with  a 
consensus  of  Pro-Arg-Ser/Thr-  [  -XHy-Ser/Thr,  where  X 
is  any  amino  acid  and  Xny  is  a  hydrophobic  amino  acid 
(preferably  Leu  or  lie)  (1 1).  This  is  a  motif  that  appears 
to  be  a  general  recognition  motif  for  MMPs  (7,  20,  21, 
36).  Using  the  one-pot  scheme,  a  similar  motif  was  also 
identified  for  MMP-9  (Table  3).  Equally  as  significant, 
the  consensus  we  observed  for  MMP-9  is  closely  related 
to  several  known  substrates  of  this  protease  (7,  37-39). 
In  addition,  none  of  these  substrates  were  cleaved  well 
by  atrolysin,  indicating  specificity  between  related  en¬ 
zymes. 

In  principle  one  could  profile  a  group  of  proteases 
individually  to  find  substrates  that  describe  the  sub¬ 
strate  preference  of  each  protease  and  compare  those 
substrates  with  each  protease  in  the  group.  The  one- 
pot  system  described  here  streamlines  this  process  in 
several  ways.  In  using  only  one  round  of  selection,  the 
number  of  phage  manipulations  is  greatly  reduced. 
Others  have  noted  that,  after  multiple  rounds  of  sub¬ 
strate  phage  selection,  phage  clones  that  are  resistant 
to  antibody  depletion  begin  to  dominate  a  population 
(7).  This  process  avoids  such  inherent  problems.  In 
addition,  this  method  reduces  the  number  of  individual 
experiments  required  to  obtain  the  same  data,  thereby 
reducing  the  amount  of  reagents  and  time  required  for 
data  acquisition.  Perhaps  most  significant  is  that  the 
ELISA-based  approach  gives  a  first  approximation  of  a 
substrates  efficiency  by  protease  for  individual  sub¬ 
strates. 

The  one-pot  system  does  have  limitations.  Even 
though  it  provides  information  on  recognition  specific¬ 
ity  on  both  sides  of  the  scissile  bond,  the  position  of 
that  bond  must  be  inferred  from  the  known  recognition 
specificity  of  the  enzyme,  or  determined  empirically 
with  synthetic  peptides,  as  done  in  this  study.  We 
envision  that  determination  of  the  scissile  bond  could 
be  streamlined  by  coupling  to  sensitive  mass  spectrom¬ 
etry  to  identify  the  mass  of  the  peptide  released  by 


protease.  In  combination  with  the  cDNA  sequence  of 
the  phage  insert,  this  measurement  could  rapidly  yield 
the  position  of  the  scissile  bond. 

Another  limitation  centers  on  the  quality  of  the  pro¬ 
tease  used  in  the  selection  and  ELISAs.  As  is  true  with 
any  enzyme,  precise  kinetic  measurements  can  only  be 
made  with  enzymes  in  which  the  active  site  has  been 
titrated.  In  this  study  we  were  unable  to  active-site 
titrate  factor  Xa.  This  inability  likely  explains  why 
fewer  substrates  were  obtained  for  factor  Xa.  However, 
this  limitation  does  not  preclude  the  identification  of 
substrates  that  are  of  great  value  in  further  analysis  of 
the  protease  functions.  Empirical  assessments  of  how 
many  proteases  could  be  profiled  simultaneously  will 
need  to  be  determined  as  well.  Despite  these  minor 
limitations,  we  envision  that  the  one-pot  approach  can 
be  applied  on  a  larger  scale,  with  greater  throughput, 
to  define  substrates  for  large  panels  of  proteases.  It  is 
also  conceivable  that,  with  modifications,  a  similar 
strategy  could  be  applied  to  screen  for  substrates  for 
other  enzymes,  like  kinases.  Likewise,  we  envision 
that  other  phage  display  systems,  including  Ml 3  and 
T7,  expressing  cDNA  libraries  could  be  modified  for  use 
in  this  type  of  system,  thus  allowing  for  a  more  rapid 
determination  of  potential  physiologic  substrates  for 
many  classes  of  enzymes. 
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Summary 


The  catalytic  domains  of  the  matrix  metalloproteinases  (MMPs)  are  structurally  homologous, 
raising  questions  as  to  the  degree  of  distinction,  or  overlap,  in  substrate  recognition.  The  primary 
objective  of  the  present  study  was  to  define  the  substrate  recognition  profile  of  MMP-2,  a 
protease  that  was  historically  referred  to  as  gelatinase  A.  By  cleaving  a  phage  peptide  library 
with  recombinant  MMP-2,  four  distinct  sets  of  substrates  were  identified.  The  first  set  is 
structurally  related  to  substrates  previously  reported  for  other  MMPs.  These  substrates  contain 
the  P-X-XJ'Xny  tXuY  =  Hvdmnhobic  residues!  consensus  motif  and  are  not  generally  selective  for 
MMP-2  over  the  other  MMPs  tested.  Two  other  groups  of  substrates  were  selected  from  the 
phage  library  with  similar  frequency.  Substrates  in  group  II  contain  the  L/I-X-XlXHy  consensus 
motif.  Substrates  in  group  HI  contain  a  consensus  motif  with  a  sequence  of  XHy-S-X^L,  and  the 
fourth  set  of  substrates  contain  the  H-X-X4xHy  sequence.  Substrates  in  Group  II,  HI  and  IV 
were  found  to  be  eight  to  almost  two  hundred-fold  more  selective  for  MMP-2  over  MMP-9.  To 
gain  an  understanding  of  the  structural  basis  for  substrate  selectivity,  individual  residues  within 
substrates  were  mutated,  revealing  that  the  P2  residue  is  a  key  element  in  conferring  selectivity. 
These  findings  indicate  that  MMP-2  and  MMP-9  exhibit  different  substrate  recognition  profiles, 
and  point  to  the  P2  subsite  as  a  primary  determinant  in  substrate  distinction. 
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INTRODUCTION 


The  matrix  metalloproteinases  (MMPs)  have  important  roles  in  a  number  of  normal  tissue 
remodeling  events  (1).  The  MMPs  are  also  of  interest  as  pharmaceutical  targets  because  of  their 
association  with  a  number  of  pathological  conditions  (2).  One  key  area  of  interest  is  tumor 
progression  and  tumor  angiogenesis.  Two  closely  related  MMPs,  MMP-2  and  MMP-9,  have 
been  strongly  linked  to  these  events.  For  example,  MMP-2  is  up-regulated  in  invasive  tumors, 
where  it  has  been  suggested  to  enact  matrix  degradation  during  tumor  growth  and  invasion  (3-5). 
Other  reports  link  MMP-2  with  the  avP3  integrin  in  the  process  of  tumor  angiogenesis  (6).  The 
fact  that  mice  lacking  the  gene  for  MMP-2  have  reduced  tumor  angiogenesis  also  supports  the 
idea  that  this  protease  has  a  causal  role  in  tumor  progression  (7).  MMP-9  is  also  implicated  in 
angiogenesis,  and  is  suggested  to  be  part  of  the  “angiogenic  switch”  that  enacts  the 
vascularization  of  tumors  (8).  Despite  the  fact  that  both  MMP-2  and  MMP-9  are  linked  to 
angiogenesis,  there  are  strong  indications  that  their  mechanisms  of  action  are  distinct.  There  are 
circumstances  where  both  proteases  are  present  within  tumors,  but  only  one  participates  in  the 
angiogenic  switch  (8).  At  present  there  is  no  mechanistic  explanation  for  this  distinction,  but  it 
raises  the  possibility  that  the  two  MMPs  operate  by  cleaving  distinct  substrates. 

The  factors  that  govern  substrate  recognition  and  substrate  distinction  by  the  MMPs  have  not 
been  fully  elucidated.  The  structural  features  of  the  catalytic  clefts  of  all  MMPs  are  generally 
similar.  All  of  the  catalytic  clefts  contain  a  zinc  ion  and  glutamic  acid  residue  involved  in 
catalysis  (9).  Furthermore,  the  MMPs  contain  a  deep  Su  pocket  (10-15).  This  subsite  has  been 
exploited  as  a  docking  point  for  the  vast  majority  of  pharmaceutical  inhibitors  of  the  MMPs. 
Perhaps  not  surprisingly  then,  many  of  these  antagonists  are  broad  spectrum  inhibitors  (16),  and 
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show  evidence  of  unwanted  side  effects  (17,18).  There  is  also  evidence  to  indicate  that  the 
common  structural  features  of  the  catalytic  cleft  lead  to  an  overlap  in  substrate  recognition  (19). 
The  vast  majority  of  known  MMP  substrates  contain  a  large  hydrophobic  residue,  which  is 
frequently  leucine,  at  the  Pi-  position.  This  is  consistent  with  the  deep  Si-  pocket.  Further 
similarity  is  observed  at  the  P3  position,  where  proline  is  often  preferred.  In  fact,  the  P-X-X^XHy 
motif  appears  to  be  an  excellent  substrate  for  a  wide  range  of  MMPs  (20-22).  However,  recent 
work  counters  the  notion  that  MMPs  have  an  overlapping  substrate  recognition  profile.  Recent 
studies  on  both  MMP- 13  and  MMP-9  show  that  a  high  degree  of  selectivity  can  be  obtained  for 
individual  MMPs,  even  among  substrates  comprised  of  the  P-X-XlXHy  motif  (21,22). 

Given  these  recent  observations,  we  wondered  whether  very  closely  related  MMPs  would  exhibit 
distinctions  in  substrate  recognition.  Thus,  we  focus  on  substrate  recognition  by  MMP-2  and  its 
closely  related  homolog,  MMP-9.  These  two  MMPs  are  unique  in  that  they  contain  three  type  II 
fibronectin  domains  intercalated  within  their  catalytic  domains.  Although  these  fibronectin 
domains  are  oriented  away  from  the  catalytic  cleft,  they  are  believed  to  mediate  docking 
interactions  with  substrates  like  gelatin  and  with  the  natural  inhibitors,  TIMPs.  Until  now  it  was 
assumed  that  MMP-2  and  MMP-9  have  overlapping  substrate  recognition  profiles.  Here  we  use 
an  unbiased  substrate  phage  approach  to  obtain  the  substrate  recognition  profile  of  MMP„-2.  We 
find  that  like  other  MMPs,  MMP-2  can  cleave  peptides  containing  the  P-X-X-XHy  sequence. 
However,  three  novel  substrate  motifs  were  also  identified.  These  novel  substrates  are  highly 
selective  for  MMP-2  over  MMP-9.  Consequently,  the  results  of  the  present  study  challenge  the 
idea  that  the  two  enzymes  are  functionally  similar  and  provide  a  potential  basis  for  their  distinct 
biological  roles. 
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Experimental  Procedures 

Source  of  Commercial  Proteins  and  Reagents 

MMP-7  (active  enzyme),  MMP-13  (pro-enzyme)  and  TIMP-2  were  purchased  from  Chemicon 
(Temecula,  Calif.).  Ilomastat  was  purchased  from  AMS  Scientific  (Concord,  Calif.).  Restriction 
enzymes  were  from  Roche  Bioseciences  or  New  England  Biolabs.  Oligonucleotides  were 
synthesized  by  Integrated  DNA  Technologies,  Inc  (Iowa).  Tissue  culture  media  and  reagents 
were  from  Irvine  Scientific  (Irvine,  Calif.).  All  other  reagents,  chemicals,  and  plastic  ware  were 
from  Sigma  or  Fisher. 

Construction  of  the  substrate  phage  library 

The  substrate  phage  library  used  in  this  study  was  generated  using  a  modified  version  of  the 
fUSE5  phagemid  (23),  as  we  have  previously  described  (22).  The  library’s  primary  features  are 
a  random  peptide  hexamer  on  the  N-terminus  of  the  gene  in  protein  and  a  FLAG  epitope 

g 

positioned  to  the  N-terminus  of  the  hexamer.  The  substrate  phage  library  represents  2.4  x  10 
individual  sequences,  ensuring  with  75%  confidence  that  all  possible  sequences  are  represented 
(23). 

Expression  and  purification  of  the  catalytic  domain  of  human  MMP-2  and  9 
The  cDNA  encoding  the  catalytic  domain  of  MMP-2  was  generated  by  PCR  and  cloned  into  the 
pCDNA3  expression  vector  (Invitrogen).  The  expression  vector  was  used  to  transfect  HEK  293 
cells.  Individual  antibiotic  resistant  colonies  were  isolated  with  cloning  rings,  expanded,  and 
then  screened  by  RT-PCR  for  MMP-2  mRNA.  Expression  of  MMP-2  protein  was  assessed  by 
zymography  of  conditioned  medium.  For  purification,  cells  were  grown  for  five  days  in  DMEM 


6 


with  10%  FBS.  The  catalytic  domains  of  MMP-2  and  9  were  purified  from  the  conditioned 
medium  bv  eelatin-Senharose  chromatography  as  described  previously  (24,25)  followed  by  ion 
exchange  on  O-senharose.  Fractions  containing  MMP-2  or  9  were  concentrated  in  a  dialysis  bag 
using  Aauacide  TT  tCalbinchemV  The  purity  of  MMP-2  and  9  was  greater  than  90%  judging  by 
silver  stained  acrylamide  gels.  The  nurified  MMP-2  and  9  were  stored  at  -70°C  at  concentrations 
ranging  from  0.4  -1.3mg/ml. 

Activation  and  active  site  titration  of  proteases 

MMPs  were  activated  by  2mM  p-Aminophenylmercuric  Acetate(APMA)  at  room  temperature  as 
previously  described  (24).  Although  APMA  may  be  a  less  efficient  wav  to  activate  MMP-2  arid 
Q  than  utilization  of  other  MMPs  as  it  leaves  a  bit  of  the  propeptide  (including  the  free  cysteine) 
still  attached,  we  chose  APMA  activation  to  avoid  contamination  of  other  proteases.  Besides  the 
presence  of  leftover  nronetide  is  less  likely  to  interfere  with  the  catalytic  activity  against  peptide 
substrates.  After  activation,  the  activities  of  MMP-2  and  MMP-9  were  titrated  using  the 
hydroxamate  inhibitor  Ilomastat  as  previously  described  (22,26).  The  active  sites  of  full  length 
MMP-13  and  MMP-7  were  titrated  with  human  TIMP-2.  Briefly,  5-15  nM  of  each  protease  was 
pre-incubated  with  a  range  of  TIMP-2  for  five  hours  at  room  temperature.  Residual  MMP-13 
activity  was  monitored  by  cleavage  of  MCA-Pro-Cha-Gly-Nva-His-Ala-Dpa-NH2  (Calbiochem). 
Residual  MMP-7  activity  was  monitored  by  cleavage  of  MCA-Pro-Leu-Gly-Leu-Dpa-Ala-Arg- 
NH2  (Calbiochem). 

Selection  of  MMP-2  substrates  from  the  phage  library 

The  substrate  phage  library  (2  x  1010  phage)  was  incubated  with  2.5  pg/ml  of  MMP-2  in  50  mM 
Tris,  pH  7.4,  100  mM  NaCl,  10  mM  CaCl2,  0.05%  Brij-35,  and  0.05%  BSA  for  1  hour  at  37°C. 
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A  control  reaction  was  performed  without  protease.  The  cleaved  phage  were  separated  from  the 
non-cleaved  phage  by  immuno-depletion.  100  pg  of  an  anti-FLAG  monoclonal  antibody  (Sigma) 
was  added  to  the  phage  samples  and  then  incubated  for  18  hours  with  rocking  at  4°C.  The  phage- 
antibody  complexes  were  twice  precipitated  by  the  addition  of  100  pi  Pansorbin  (Calbiochem). 
The  cleaved  phage  remaining  in  the  supernatant  were  amplified  using  K91  E.  coli ,  and  were 
then  used  for  one  additional  round  of  substrate  selection. 

Monitoring  phage  hydrolysis  by  ELISA 

Hydrolysis  of  individual  phage  substrates  was  measured  using  a  modified  ELISA  that  we  have 
previously  described  (22).  Briefly,  phage  from  overnight  cultures  were  captured  into  microtiter 
plates  coated  with  anti-M13  antibody  (Pharmacia,  2.5  pg/ml).  The  captured  phage  were 
incubated  with  MMP-2  (2.5  pg/ml)  in  Incubation  Buffer  (50  mM  Tris,  pH  7.5, 100  mM  NaCl,  10 
mM  CaCl2, 0.05%  BSA,  0.05%  Brij-35,  50  pM  ZnCl2)  for  2  hours  at  37°C.  Control  wells  lacked 
protease.  Following  hydrolysis  and  extensive  washing,  anti-FLAG  polyclonal  antibody  (1.8 
pg/ml  in  TBS-T  with  1  mg/ml  BSA)  was  added  to  the  wells  and  incubated  for  1  hr.  Following 
additional  washing,  the  level  of  bound  anti-FLAG  antibody  was  quantified  with  an  HRP- 
conjugated  goat  anti-rabbit  IgG  antibody  (BioRad)  followed  by  detection  at  490  nm.  The  extent 
of  hydrolysis  of  each  phage  was  calculated  by  the  ratio  of  the  O.D.  at  490  nm  of  the  pfotease- 
treated  samples  versus  samples  lacking  protease. 


Mapping  the  position  of  scissile  binds  within  peptide  substrates 

Peptides  representing  the  phage  inserts  were  synthesized  to  our  specifications  by  Annaspec  Inc.. 
The  N-termini  were  acetylated  and  C-termini  were  synthesized  as  amides.  The  cleavage  site 
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within  peptide  substrates  was  determined  using  MALDI-TOF  mass  spectrometry.  MMP-2  (25 
nM)  was  incubated  with  200  pM  of  each  peptide,  independently,  in  50  mM  Tris,  pH  7.5,  100 
mM  NaCl,  10  mM  CaCl2  for  2  hours  at  37°C.  The  mass  of  the  cleavage  products  was  determined 
using  a  Voyager  DE-RP  MALDI-TOF  mass  spectrometer  (PerSeptive  Biosysytems, 
Framingham,  MA).  Following  hydrolysis,  the  peptide  samples  were  prepared  for  MALDI 
analysis  according  to  methods  described  previously  (27),  and  subsites  within  the  peptide  were 
designated  according  to  the  nomenclature  of  Schechter  and  Berger  (28). 

Quantifying  the  kinetics  parameters  of  peptide  hydrolysis 

The  kinetic  parameters  of  substrate  hydrolysis  were  measured  using  a  fluorescamme 
incorporation  assay  that  has  been  previously  described  (29-32).  Briefly,  MMP-2,  MMP-9, 
MMP-7,  or  MMP-13  were  incubated  with  individual  peptide  substrates  at  concentrations  ranging 
from  100-800  pM  in  50  mM  Tris,  pH  7.5,  100  mM  NaCl,  10  mM  CaCl2,  50  pM  ZnCl2.  At 
selected  time  points  the  reactions  were  stopped  by  the  addition  of  1,10-phenanthroline.  Peptide 
hydrolysis  was  determined  by  the  addition  of  fluorescamine  followed  by  detection  at  X,ex  355  nm 
and  Xem  460  nm.  The  data  were  transformed  to  double  reciprocal  plots  (1/[S]  vs  1/Vj)  to 
determine  Km  and  kcat  (29-32).  For  some  substrates,  Km  and  kcat  could  not  be  determined 
individually,  but  the  specificity  constant,  kcJKm,  was  derived  by  the  equation:  kcat/Km= 
Vi/(Eo)(So)  assuming  [S]  is  significantly  lower  than  the  Km  (19). 

Assessing  cleavage  of  Recombinant  Eph  receptors  by  MMP-2  and  MMP-9 

Recombinant  fusion  proteins  between  EphBl,  EphB2  and  the  Fc  domain  of  IgG  were  purchased 

from  R&D  Systems  Inc..  In  these  constructs,  the  extracellular  domain  of  rat  EphBl  (amino  acid 
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residues  1-538)  and  mouse  EphB2  (amino  acid  residues  1-548)  are  fused  to  the  Fc  region  of 
human  IgG  via  a  short  polypeptide  linker  (33).  The  fusion  proteins  (1 .8uM)  were  incubated  for  4 
hours  at  37°C  with  280  nM  of  either  MMP-2  or  MMP-9.  Following  incubation,  samples  were 
resolved  by  10%  SDS-PAGE,  and  samples  were  visualized  by  Coomassie  staining.  The  N- 
terminus  of  the  cleaved  Eph  B1  was  determined  by  automated  Edman  degradation  of  protein 
blotted  to  PVDF  membranes. 
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RESULTS 


Activated  MMP-2  was  used  to  select  optimal  substrates  from  a  phage  display  library  that  we 
have  previously  described  (22).  Following  two  rounds  of  phage  hydrolysis  and  subsequent 
amplification,  several  individual  phage  clones  were  selected  randomly  and  assessed  for 
hydrolysis  by  MMP-2.  The  phage  vector  encodes  a  FLAG  epitope  that  is  positioned  to  the  N- 
terminal  side  of  the  randomized  peptide  hexamer.  Therefore,  cleavage  within  the  hexamer  by 
MMP-2  was  gauged  by  measuring  the  liberation  of  the  FLAG  epitope  using  an  ELISA.  Thirty 
individual  clones  were  selected  for  sequencing  based  on  the  fact  that  they  were  cleaved  by  more 
than  25%  when  incubated  for  two  hours  with  activated  MMP-2  (60  nM).  Four  distinct  groups  of 
substrates  are  found  among  these  clones  (Table  I). 

The  first  group  of  substrates  contains  the  P-X-X-XHy  motif,  where  XHy  represents  a  large 
hydrophobic  residue.  This  motif  appears  to  be  a  substrate  for  a  number  of  different  MMPs  (20- 
22).  Substrates  in  group  II-IV  represent  novel  recognition  motifs  for  MMP-2.  Group  II 
substrates  contain  the  I/L-X-X-XHy  motif,  in  which  the  last  hydrophobic  residue  is  usually  lie  or 
Leu.  Group  in  substrates  contain  the  XHy-S-X-L  motif,  where  Ser  and  Leu  are  invariant.  Group 
IV  substrates  are  comprised  of  the  H-X-X-XHy  motif,  which  is  similar  to  the  cleavage  site  for 
MMP-2  within  laminin-5  (34). 

Assessing  the  selectivity  of  the  MMP-2  Substrates. 

We  compared  the  rate  of  hydrolysis  of  a  set  of  representative  phage  substrates  by  MMP-2  and 
MMP-9  (Figure  1).  With  the  exception  of  one  clone,  A45,  the  phage  from  group  I  lacked 
selectivity  for  MMP-2  over  MMP-9.  In  contrast  however,  all  of  the  phage  substrates  from 
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groups  II  and  III  were  highly  selective  for  MMP-2  over  MMP-9.  The  extent  of  hydrolysis  of 
phage  substrates  from  group  IV  was  not  compared  using  this  assay  because  their  rate  of 
hydrolysis  by  MMP-2  was  generally  low.  These  substrates  were  characterized  in  more  detail 
with  the  aid  of  synthetic  peptides  (see  below). 

Characterization  of  Synthetic  Peptide  Substrates 

Representative  peptides  from  groups  I-IV  were  synthesized  and  used  to  characterize  substrate 
hydrolysis  in  greater  detail.  The  peptides  were  initially  used  to  determine  the  position  of  the 
scissile  bond  within  each  motif  by  analyzing  the  cleavage  products  by  MALDI-TOF  mass 
spectrometry  (Table  II).  In  virtually  all  cases,  the  scissile  bond  directly  precedes  a  large 
hydrophobic  residue,  which  is  frequently  lie  or  Leu  (Table  I).  This  feature  is  consistent  with  the 
presence  of  a  deep  binding  pocket  at  the  corresponding  Sr  subsite  within  MMP-2  (14). 

The  synthetic  peptide  substrates  were  also  used  to  gain  more  detailed  information  on  the 
selectivity  of  the  substrate  motifs  for  individual  MMPs.  We  compared  the  rate  of  hydrolysis  of 
each  peptide  by  MMP-2,  MMP-9,  MMP-7  and  MMP-13.  Each  enzyme  was  quantified  by  active 
site  titration  to  ensure  that  accurate  comparisons  were  obtained.  The  initial  velocity  of 
hydrolysis  was  measured  across  a  range  of  peptide  concentrations  and  reciprocal  plots  were  used 
to  derive  Km  and  kcat  for  MMP-2,  and  the  kcJKm  ratio  for  each  enzyme  (Table  III).  A  double 
reciprocal  plot  of  B74  peptide  hydrolysis  serves  as  an  illustration  of  how  Km.  and  kmt  for  MMP-2 
were  derived  tfigure  31.  When  cleaved  by  MMP-2,  most  of  the  substrates  exhibited  Michaelis 
constants  in  the  mM  range,  with  turnover  rates  ranging  from  1  to  750  s'1.  Correspondingly,  the 
kcJKm  ratios  were  all  between  1.6  x  104  M'V1  and  2  x  105  M'V1.  Most  significantly,  substrates 
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from  groups  II,  El  and  IV  exhibit  kcJKm  ratios  that  are  eight  to  three  hundred  and  fifty-fold 
higher  for  MMP-2  than  for  the  other  MMPs.  As  a  group,  the  substrates  within  group  II  show  the 
most  selectivity.  The  selectivity  of  these  substrates  for  MMP-2  is  conveniently  illustrated  by  the 
ratio  of  kcatIKm  for  MMP-2  divided  by  the  same  value  for  the  other  MMPs  (Table  IV). 

The  Role  of  Residues  at  P3  and  P2  in  Conferring  Substrate  Selectivity 

Several  experiments  were  conducted  to  gain  a  better  understanding  of  the  structural  basis  for  the 
selectivity  of  the  substrates  in  group  II-IV.  Two  hypotheses  were  tested.  The  first  centered  on 
the  fact  that  neither  group  II  nor  group  III  substrates  contain  a  Pro  at  the  P3  position.  Since  this 
residue  is  frequently  found  in  substrates  for  other  MMPs,  and  is  also  present  at  this  position  in 
group  I  substrates,  we  reasoned  that  its  absence  may  be  a  defining  feature  of  the  selective 
substrates.  To  test  this  hypotheses,  variants  of  three  peptides  from  groups  II  and  III  were 
synthesized  to  contain  a  Pro  at  P3,  and  their  hydrolysis  by  each  MMP  was  measured  (Table  III). 
Interestingly,  the  substitution  of  Pro  into  the  P3  position  of  peptide  B74  and  A13  increased  the 
kcat/Km  ratio  for  all  MMPs.  However.  Pro  substitution  did  not  reduce  the  selectivity  exhibited  by 
the  nentides  completely.  Hence,  the  lack  of  Pro  at  P^  may  partially  involve  in  determining  the 
selectivity  but  is  not  the  nrimarv  determinant  in  the  selectivity  of  these  substrates  for  MMP-2 
because  nentides  with  Pro  substitution  still  confer  good  selectivity  to  MMP-2  over  MMP-9_Q5 
fold!. 

The  second  hypothesis  was  based  on  our  recent  characterization  of  the  substrate  recognition 
profile  of  MMP-9  (22).  That  study  revealed  a  critical  role  or  preference  for  Arg  at  the  P2 
position.  Since  Arg  is  rarely  present  at  P2  in  the  substrates  selected  for  MMP-2,  we  hypothesized 
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that  substitution  of  Arg  into  P2  might  shift  selectivity  away  from  MMP-2  and  toward  MMP-9. 
Indeed,  in  the  three  peptides  tested,  B74,  A13  and  C9,  the  substitution  of  Arg  into  the  P2  position 
dramatically  increased  hydrolysis  by  MMP-9.  This  substitution  also  significantly  decreased 
hydrolysis  by  MMP-2.  In  combination,  these  effects  completely  switch  the  selectivity  ratio  of 
the  mutated  peptides  (Table  IV).  These  findings  underscore  the  significance  of  Arg  at  P2  in 
facilitating  substrate  recognition  by  MMP-9,  and  also  point  to  the  important  role  of  the  S2  subsite 
in  distinguishing  the  activity  of  MMP-2  and  MMP-9.  Interestingly,  the  substitution  of  Arg  at  P2 
had  minimal  effects  on  the  kcat/Km  ratio  of  MMP-7  or  MMP-13  (Table  III),  indicating  that  other 
features,  that  are  still  not  understood,  confer  selectivity  of  these  peptides  for  MMP-2  over  MMP- 
7  and  MMP-13. 

We  noticed  that  the  sequence  of  substrate  A21  is  similar  to  an  MMP-2  selective  cleavage  site 
within  laminin-5  (34).  Peptide  A21  has  a  kcJKm  ratio  that  is  eight-fold  higher  for  MMP-2  over 
MMP-9.  Interestingly  though,  it  contains  a  rather  large  residue,  Lys,  at  P2.  Because  smaller 
residues  are  favored  at  P2  by  MMP-2,  we  mutated  this  Lys  to  Ala  as  an  additional  test  of  the  P2 
residue  in  conferring  selectivity.  In  addition,  this  mutation  makes  the  sequence  of  the  peptide 
match  more  closely  with  the  cleavage  site  within  laminin-5,  which  contains  the  HAAL  sequence 
(34).  The  mutated  peptide  is  hydrolyzed  by  MMP-2  more  than  ten-fold  better  than  the  A21 
parent  peptide.  This  mutation  was  without  significant  effect  on  hydrolysis  by  MMP-9.  These 
observations  support  the  idea  that  the  P2/S2  interaction  is  key  to  distinguishing  substrate 
recognition  by  MMP-2  and  MMP-9. 

Selective  Hydrolysis  of  A  Protein  Substrate  Containing  the  S-X-L  Motif. 
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Ultimately,  one  would  hope  to  be  able  to  use  the  substrate  recognition  profiles  obtained  from 
substrate  phage,  and  other  substrate  profiling  strategies  (35,36),  to  generate  hypotheses  regarding 
physiologic  substrates.  However,  there  have  not  been  enough  test  cases  to  establish  the  rules  for 
such  extrapolations.  As  an  initial  step  in  this  direction,  we  compared  the  ability  of  MMP-2  and 
MMP-9  to  cleave  Eph  B1  and  Eph  B2.  Enh  B1  and  B2  are  tyrosine  kinase  receptors  that  are 
resnonsible  for  cell-cell  signaling  during  neuronal  development.  These  proteins  contain  putative 
cleavage  sites  that  correspond  rather  closely  to  the  substrates  selected  from  the  phage  library  by 
MMP-2.  There  are  two  potential  cleavage  sites  within  Eph  Bl.  The  first  motif  contains  the 
sequence  is  S-I-S-S-L-W,  which  matches  well  with  the  XHy-S-X^L  motif  of  the  group  HI 
substrates.  This  motif  is  positioned  within  a  predicted  p-strand  within  a  fibronectin  repeat  in  Eph 
Bl  (37).  A  second  potential  cleavage  site  with  a  sequence  of  K-S-E-L,  is  located  in  a  ten  residue 
linker  between  the  membrane  spanning  segment  and  the  second  type  III  repeat  of  Eph  Bl.  This 
sequence  does  not  precisely  match  the  motif  of  the  group  IH  substrates,  but  it  does  contain  the 
core  S-X-L  motif.  In  the  recombinant  form  of  Eph  Bl  used  here,  this  putative  cleavage  site  is 
positioned  in  a  short  segment  between  the  second  fibronectin  type  III  repeat,  and  the  Fc  domain 
of  IgG.  Eph  B2  contains  only  a  single  putative  cleavage  site,  with  a  sequence  of  is  Y-I-S-D^L- 
L.  This  motif  is  positioned  in  a  predicted  p-strand  in  the  second  type  III  fibronectin  repeat  and  it 
corresponds  to  the  XHy-S-X^L  motif  of  group  III  substrates.  EphB2  lacks  the  second  predicted 
cleavage  site,  even  though  the  recombinant  protein  contains  an  analogous  linker. 

The  Eph  Bl  and  Eph  B2  fusion  proteins  were  incubated  with  equimolar  amounts  (280  nM)  of 
MMP-2  and  MMP-9  for  four  hours.  The  extent  of  hydrolysis  was  gauged  by  SDS-PAGE  (Figure 
2).  The  Eph  Bl-Fc  fusion  protein  was  almost  quantitatively  cleaved  by  MMP-2  (Figure  2,  lane 
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2).  The  extent  of  cleavage  by  MMP-9  was  far  lower  (Figure  2,  Lane  3),  which  is  not  surprising 
considering  the  selectivity  exhibited  by  the  group  II  substrates.  Neither  protease  cleaved  the  Eph 
B2  fusion  protein.  The  site  of  hydrolysis  within  Eph  B1  was  determined  by  sequencing  the  N- 
terminus  of  one  of  the  released  fragments.  The  amino  acid  sequence  of  this  fragment  indicates 
that  the  protein  was  cleaved  at  the  sequence  D-D-Y-K-S-E  i  L-R-E,  that  is  found  within  the  ten 
residue  linker  of  Eph  Bl.  This  is  one  of  the  predicted  cleavage  sites  in  EphBl.  These  findings 
illustrate  that  motifs  found  to  be  selective  for  MMP-2  by  substrate  phage  display,  can  act  as 
selective  substrates  within  the  context  of  whole  proteins.  Equally  as  important  however,  this 
experiment  illustrates  that  the  three  dimensional  conformation  of  the  putative  cleavage  site  will, 
to  a  large  degree,  control  the  extent  of  hydrolysis.  Therefore,  we  conclude  that  meaningful 
genome-wide  predictions  on  putative  physiologic  substrates  will  require  the  incorporation  of 
additional  information  and  computational  filters  (see  Discussion). 
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Discussion 


Despite  their  high  structural  homology,  MMP-2  and  MMP-9  are  reported  to  have  distinct 
biological  roles.  For  example,  in  tumors  in  the  RIP-Tag  mouse,  where  both  proteases  are 
present,  only  MMP-9  seems  to  be  causally  involved  in  the  angiogenic  switch  of  these  tumors  (8). 
Similarly,  in  the  process  of  platelet  aggregation,  MMP-2  promotes  aggregation,  but  MMP-9 
inhibits  aggregation  (38).  The  reasons  for  these  differences  in  function  are  not  clear.  One 
possible  explanation  could  be  that  the  two  enzymes  have  distinct  substrate  recognition  profiles. 
Until  now  however,  there  has  been  no  systematic  and  unbiased  comparison  of  these  profiles. 
Here,  we  defined  the  substrate  recognition  profile  of  MMP-2,  and  compared  it  to  that  of  the 
closely  related  MMP-9.  The  findings  of  the  study  show  that  although  two  MMPs  can  cleave  a 
common  set  of  substrates  with  the  P-X-X-XHy  sequence,  MMP-2  hydrolyzes  an  additional  array 
of  peptide  substrates.  These  observations  support  the  idea  that  differences  in  biological  function 
of  MMP-2  and  MMP-9  could  stem  from  their  action  on  distinct  physiologic  substrates. 

We  found  that  like  other  MMPs,  MMP-2  efficiently  cleaves  peptide  substrates  that  contain  the  P- 
X-X-XHy  motif.  This  observation  is  consistent  with  the  idea  that  this  motif  is  recognized 
universally  by  the  MMP  family.  The  sequence  of  this  motif  is  generally  complementary  with  the 
structural  features  within  the  catalytic  clefts  of  the  MMPs,  including  a  deep  Si*  binding  pocket, 
and  a  general  deviation  from  linearity  extending  from  the  S3  position,  which  is  occupied  by  the 
invariant  Pro  in  this  group  of  substrates.  Even  within  the  context  of  this  commonly  recognized 
motif,  some  substrate  selectivity  is  evident.  Substrates  for  MMP- 13  that  contain  this  motif 
exhibit  good  selectivity  for  MMP-13  over  three  other  MMPs  (21).  Similarly,  the  MMP-9 
substrates  with  this  motif  that  we  recently  characterized  are  selective  for  MMP-9  over  MMP-7 
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and  MMP-13.  Interestingly,  those  substrates  were  found  to  be  cleaved  equally  well  by  MMP-9 
and  MMP-2  (data  not  shown).  Here  we  observed  one  substrate  with  the  P-X-X-Xny  sequence, 
phage  A45,  that  is  selective  for  MMP-2  over  MMP-9,  but  such  selectivity  among  the  two 
gelatinases  does  not  appear  to  be  a  general  property  of  substrates  containing  P-X-X-XHy. 

Three  of  the  novel  groups  of  substrates  for  MMP-2  were  characterized  in  detail  and  were  found 
to  be  selective  for  MMP-2  over  the  other  MMPs  tested.  These  substrates  contain  consensus 
motifs  of  L/I-X-X-XHy,  XHy-S-X-L,  and  H-X-X-XHy.  All  three  sets  of  substrates  have  Michaelis 
constants  in  the  low  mM  range,  and  turnover  rates  of  several  hundred  per  second.  They  also 
contain  large  hydrophobic  residues  at  the  Pr  position,  a  feature  that  is  consistent  with  the  depth 
of  the  Sr  pocket.  The  dominant  role  of  this  interaction  to  substrate  recognition  by  the  MMPs  is 
underscored  by  presence  of  a  hydrophobic  residues  at  this  position  in  virtually  all  substrates 
selected  for  MMPs  out  of  randomized  libraries  (20-22).  The  P3  residue  of  group  II  and  III 
substrates  also  has  a  hydrophobic  character.  In  group  II  this  position  is  an  invariant  lie  or  Leu. 
Interestingly,  an  invariant  Ser  at  P2  is  the  feature  that  distinguishes  substrates  in  group  II  and  III. 
The  group  IV  substrates  are  unique  in  that  the  P3  residue  is  occupied  by  histidine.  We  did  not 
identify  a  sufficient  number  of  substrates  within  this  group  to  arrive  at  a  consensus  at  other 
positions. 

The  findings  presented  here  also  begin  to  provide  a  structural  basis  for  substrate  selectivity 
between  MMP-2  and  MMP-9.  In  our  selection  of  substrates  for  MMP-9  (22),  we  found  Arg  to 
be  preferred  at  P2,  whereas  Arg  was  rarely  observed  at  this  position  in  the  substrates  selected  for 
MMP-2.  Rather,  the  MMP-2  substrates  frequently  display  a  relatively  small  residue  at  P2.  For 
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example,  this  position  occupied  by  an  invariant  Ser  in  the  group  III  substrates.  When  Arg  is 
substituted  into  the  P2  position  in  the  MMP-2  substrates,  a  marked  shift  in  substrate  recognition 
was  observed.  In  all  cases  the  rate  of  hydrolysis  by  MMP-2  was  decreased,  and  hydrolysis  by 
MMP-9  was  increased.  In  fact,  this  substitution  increased  the  kcat/Km  ratios  for  MMP-9  between 
25  and  60-fold.  Further  support  for  the  role  of  the  P2  residue  in  substrate  recognition  comes  from 
analysis  of  the  group  IV  substrate  A21,  which  contains  Lys  at  P2.  We  converted  this  Lys  to  Ala 
in  order  to  more  closely  mimic  the  natural  sequence  in  laminin-5,  which  is  known  to  be 
selectively  cleaved  by  MMP-2.  Substitution  of  this  Lys  to  Ala  dramatically  increased  the  kcJKm 
ratio  for  MMP-2,  providing  further  support  for  the  idea  that  the  S2  subsite  within  MMP-2  is 
sterically  hindered.  It  is  interesting  to  note  that  the  substitution  of  Arg  into  P2  of  the  substrate 
had  little  effect  on  the  already  poor  recognition  of  these  motifs  by  MMP-7  and  MMP-13.  Hence, 
the  primary  effect  of  substitutions  at  P2  is  a  switch  in  substrate  recognition  by  the  closely  related 
gelatinases. 

Analysis  of  the  structure  of  the  S2  subsite  within  MMP-2  (14)  and  MMP-9  reveals  a  potential 
structural  basis  for  this  distinction  in  substrate  recognition  at  the  P2  position  (Figure  4).  The  S2 
pocket  is  remarkably  similar  in  both  enzymes,  save  the  presence  of  Glu-412  in  MMP-2  which  is 
replaced  by  an  Asp  in  MMP-9.  Our  prior  docking  of  substrates  containing  Arg  at  P2  into  the 
catalytic  cleft  of  MMP-9  indicated  a  favorable  interaction  between  the  positively  charged 
guanidino  group  of  Arg  with  the  acidic  side  chain  of  Asp-410  (22).  Since  Glu  contains  an 
additional  methylene  group  in  its  side  chain,  Glu-412  would  be  expected  to  extend  further  into 
the  S2  pocket  of  MMP-2  (orange  arrow).  Thus  Glu-412  is  expected  to  occlude  the  S2  subsite,  and 
hinder  the  docking  of  substrates  which  contain  larger  residues,  like  Arg,  at  the  P2  position. 
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Another  distinction  between  the  two  proteases  is  observed  at  Ala- 196  of  MMP-2,  which  is 
replaced  by  Pro-193  in  MMP-9.  The  pyrrolidine  ring  of  Pro-193  of  MMP-9  extends  further  out 
into  the  S3  space.  This  Pro  could  potentially  interfere  with  docking  of  extended  residues  like  Leu 
and  lie  that  are  often  found  at  P3  in  the  MMP-2  substrates. 

Aside  from  the  differences  in  structure  within  the  catalytic  clefts  of  MMP-2  and  MMP-9,  we 
must  also  consider  the  possibility  that  these  proteases  can  assume  distinct  conformations  that  are 
not  revealed  by  the  existing  crystal  structure  (10-13,39)  and  models  (40).  This  possibility  is 
supported  by  the  fact  that  unbiased  searches  reveal  four  separate  sets  of  substrates  for  MMP-2 
(this  report)  and  three  families  of  substrates  for  MMP-9  (22).  Thus,  it  is  conceivable  that  the 
two  pockets,  although  containing  virtually  the  same  resides,  could  adopt  different  conformations 
because  of  constraints  imposed  at  ancillary  regions  of  the  protease.  Although  there  is  little  direct 
evidence  for  this  possibility,  it  does  provide  an  alternative  hypothesis  to  explain  their  differences 
in  substrate  recognition.  Such  conformational  modulation  of  the  catalytic  pocket  could  also 
relate  to  the  biology  behind  the  two  MMPs  because  their  binding  to  other  proteins,  like  the  avp3 
integral  (6),  could  conceivably  regulate  substrate  recognition.  One  might  also  hypothesize  that 
the  binding  of  either  protease  to  collagen  through  its  type  II  fibronectin  repeats  could  provide 
similar  conformational  regulation. 

Ideally,  one  would  like  to  use  the  information  obtained  from  substrate  profiling  approaches,  like 
the  one  reported  here,  and  elsewhere  (35,36),  to  predict  the  physiologic  and  pathophysiologic 
substrates  for  proteases.  Our  findings  indicate  that  such  predictive  strategies  have  merit  but  that 
they  will  require  additional  refinement.  For  example,  it  is  encouraging  that  the  differences  in  the 


20 


substrate  phage  profiles  for  MMP-2  and  MMP-9  are  reflected  by  the  selective  cleavage  of  Eph 
B1  by  MMP-2.  It  is  equally  encouraging  that  the  closely  related  homologue  Eph  B2,  which 
lacks  the  predicted  MMP-2  cleavage  site,  is  not  cleaved  by  MMP-2.  The  group  IV  substrates 
that  contain  the  H-X-X-XHy  motif  match  closely  with  the  selective  MMP-2  cleavage  site  in 
laminin-5  (sequence  of  H-A-A  i  L-T-S)  (34),  an  observation  that  provides  additional  support  for 
the  idea  that  substrate  profiling  could  ultimately  have  predictive  utility. 

However,  it  is  evident  that  if  they  are  to  be  applied  on  a  genome-wide  scale  these  predictive 
methods  require  further  refinement.  Even  though  the  sequences  within  Eph  B1  and  laminin-5 
are  similar  to  the  substrate  motifs  from  phage,  they  are  not  identical.  Consequently,  using  the 
phage  profiles  to  arrive  at  a  consensus  recognition  motif  that  is  based  on  the  physical  properties 
of  preferred  residues  (e.g.  large  hydrophobic  vs.  small  hydrophilic)  rather  than  actual  residues,  is 
worthy  of  exploration.  Building  from  here  though,  additional  filters  or  constraints  will  need  to 
be  applied  to  narrow  the  number  of  putative  substrates.  For  example,  one  could  limit  searches 
for  putative  substrates  to  proteins  expressed  in  the  appropriate  cellular  compartment  or 
extracellular  space.  Further  constraints  could  be  imposed  based  on  information  obtained  from 
gene  expression  profiling,  which  will  reveal  all  genes  that  are  co-expressed  with  any  given 
protease.  Our  findings  also  indicate  that  the  degree  to  which  a  predicted  cleavage  site  is  exposed 
to  solvent  should  be  taken  into  consideration.  Since  automated  medium  resolution  structural 
predictions  can  now  be  made  across  entire  genome  (41),  it  is  not  unreasonable  to  suggest  that 
this  information  could  be  used  as  an  additional  filter  to  identify  proteins  with  accessible  cleavage 
sites. 
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Footnotes: 


APMA  =  p-Aminophenylmercuric  Acetate 
BSA  =  Albumin,  Bovine 

MALDI-TOF  =  Matrix- Assisted  Laser  Desorption  Ionization  Time  of  Flight 
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Figure  Legends 

Figure  1.  Phage  substrates  are  selective  for  MMP-2  over  MMP-9. 

The  ability  of  MMP-2  (solid  bars)  and  MMP-9  (open  bars)  to  cleave  substrate  selected  from  the 
phage  library  were  compared  using  the  phage  ELISA  procedure  described  under  Methods. 
Immoblized  anti  Ml 3  antibody  was  used  to  capture  individual  phages  onto  96-well  microtiter 
wells.  The  captured  phage  were  cleaved  with  2.5ug/ml  of  MMP-2  or  MMP-9.  The  extent  of 
cleavage  within  the  phage  insert  was  assessed  by  measuring  the  release  of  the  FLAG  epitope. 
Results  are  presented  as  the  percentage  of  hydrolysis  compared  to  non-treated  control  phage. 
This  experiment  was  repeated  three  times,  yielding  nearly  identical  results  in  each  repetition. 


Figure  2.  EphBI  is  cleaved  bv  MMP-2  but  not  bv  MMP-9. 

The  ability  of  MMP-2  to  cleave  predicted  sites  within  EphBI  was  tested  using  a  recombinant 
fusion  protein  between  Eph  B1  and  the  Fc  domain  of  IgG.  The  Eph  Bl-Fc  fusion  protein  and  the 
corresponding  fusion  protein  encoding  the  Eph  B2  homologue  (1.8uM  of  each)  were  incubated 
for  4  hours  at  37°C  with  280nM  of  MMP-2  or  MMP-9.  Following  this  incubation,  samples  were 
resolved  by  10%  SDS-PAGE,  and  the  proteins  were  visualized  by  Coomassie  blue  staining.  The 
position  of  the  fragment  of  Eph  B1  generated  by  MMP-2  is  shown  by  an  arrow.  The 
recombinant  EphB2  migrates  at  80  kDa.  A  contaminant  is  present  at  lOOkd. 

Figure  3.  The  Double  Reciprocal  Plot  of  B74  Peptide  Cleaved  bv  MMP-2. 

The  initial  velocity  of  B74  peptide  hydrolysis  was  measured  by  incubating  12.5nM  of  active 
MMP-2  with  100,  200,  400,  and  800uM  of  peptide.  The  double  reciprocal  plot  of  1/[S]  vs.  1/V 
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was  then  generated  and  used  to  derive  an  equation  from  the  best-fit  line.  Value  of  Km  is  equal  to 
-1/X-intercept  and  value  of  kcat  is  equal  to  Vmax/[E][S]. 

Figure  4.  Model  of  the  Catalytic  Cleft  of  MMP-2  and  MMP-9. 

Models  of  the  catalytic  domain  of  MMP-2  (top  panel)  and  MMP-9  (bottom  panel)  were 
constructed  using  the  Modelled  software  from  Rockefeller  University  (42).  Models  were 
constructed  using  the  coordinates  of  the  catalytic  domain  of  MMP-2  from  the  reported  crystal 
structure  (14)(PDB  accession  number  lqibA).  The  model  of  MMP-9  is  based  on  the  crystal 
structure  of  MMP-2.  A  view  looking  into  the  catalytic  cleft  is  illustrated,  with  the  zinc  ion 
colored  orange.  The  side  chains  of  the  three  His  residues  that  ligand  with  zinc  are  evident.  The 
S2  and  S3  subsites  fall  within  the  cleft  to  the  left  of  the  zinc  ion.  Glu-412  protrudes  into  the  S2 
subsite  of  MMP-2  (orange  arrow).  In  MMP-9  the  corresponding  residue  is  Asp-410  (orange 
arrow).  One  other  notable  difference  among  the  two  proteases  is  Pro- 193  in  MMP-9,  whose 
pyrrolidine  ring  protrudes  away  from  the  upper  rim  of  the  catalytic  cleft  and  into  the  interface 
between  the  S2  and  S3  subsites  (white  arrow).  In  MMP-2  this  Pro  is  substituted  by  Ala,  creating 
a  relatively  unobstructed  surface  across  the  S3  space. 
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Table  I.  Sequences  of  phage  substrates  for  MMP-2.  Phage  from  the  second  round  of  selection 
were  screened  for  hydrolysis  by  MMP-2  using  an  ELISA.  Two  separate  experiments  were 
performed.  Based  on  this  result,  inserts  of  individual  phages  were  sequenced  to  obtain  the  amino 
acid  sequences  of  the  displayed  peptides.  The  substrates  are  separated  into  four  structural  motifs. 
Phage  are  also  ranked  according  to  the  extent  of  hydrolysis  by  MMP-2  at  2  hours,  (n.d.  =  not 
determined) 

_ Peptide  Sequence _ %  Hydrolysis 

Group _ P3  P2  Pi  Pr  P2'  P3' _ Clone _ expl _ exp2 

I  S  G  P  Y  V  I  W  L  A45  100  100 

R  P  P  M  A  K  T  A  A50  100  100 

S  G  P  V  W  Y  M  S  A26  100  98 

S  G  P  V  R  Y  M  V  A3 5  100  90 

G  L  P  R  W  L  L  T  A47  91  90 

G  R  P  I  R  M  K  T  A46  51  49 

II  L  R  L  A  A  I  T  A  B49  100  100 

I  YLGWATA  A29  100  100 

E  S  L  A  Y  Y  T  A  B74  81  90 

G  S  L  H  S  I  I  T  B54  72  75 

SDIRMLTA  B79  64  70 

FNLY  NLTA  B66  51  50 

G  Y  L  Q  V  L  L  T  B46  48  50 

S  G  I  V  N  L  Y  P  A43  38  40 

E  Y  L  H  M  R  T  A  A10  36  40 

VGLIAITA  A6  30  44 

III  RS  LSRLTA  C9  100  100 

NRYSSLTA  A34  100  100 

W  T  S  S  W  L  T  A  A31  100  100 

GAVSWLLT  A13  100  93 

P  M  I  S  V  L  T  A  A54  98  96 

A  N  I  S  D  L  T  A  B37  97  95 

T  I  L  S  L  L  T  A  B53  73  75 

G  F  N  S  M  L  K  T  C3  28  30 

IV  G  L  H  R  R  I  D  T  C5  60  n.d. 

GMHSRP  PT  A9  50  63 

H  M  H  K  A  L  T  A  A21  30  49 

G  R  H  L  G  L  Q  T  A4  30  45 

GLHKKVHT  C6  19  n.d. 

GAHAKHWT  D3  13  n.d. 
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Table  II.  Identification  of  the  position  of  scissile  bonds  within  MMP-2  substrates.  Representative 
peptides  were  synthesized  from  each  group  of  substrates.  Using  MALDI-TOF  mass  spectrometry  the 
position  of  scissile  bonds  was  determined  by  analyzing  the  mass  of  cleaved  peptide  fragments  generated 
by  MMP-2.  The  position  of  the  scissile  bond  is  noted  by  placing  the  Pi  and  Pr  residues  in  bold. 

Peptide  Sequences  Peptide  Mass  Observed  fragments  (Da) 


Group  I 

A3 

A 

K 

P3 

P 

P2 

R 

Pi 

A 

Pi 

L 

P2 

T 

P3' 

A 

1013.7 

727.9 

Group  II 

B49 

L 

R 

L 

A 

A 

I 

T 

A 

1013.8 

728.73 

B74 

E 

S 

L 

A 

Y 

Y 

T 

A 

1089 

804.27 

Group  III 

C9 

R 

S 

L 

S 

R 

L 

T 

A 

1103.4 

790.29“ 

A13 

G 

A 

V 

s 

W 

L 

L 

T 

1046.9 

670.06a 

B37 

A 

N 

I 

s 

D 

L 

T 

A 

1012.9 

729.03 

A34 

N 

R 

Y 

s 

S 

L 

T 

A 

1097.2 

812.41 

Group  IV 

A21 

H 

M 

H 

K 

A 

L 

T 

A 

1093.7 

809.61 

a  denotes  the  mass  of  Na+  adducts  of  the  peptide  fragments. 
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Table  III.  Measuring  peptide  hydrolysis  by  a  panel  ofMMPs.  The  hydrolysis  of  synthetic  peptides  by 
different  MMPs  was  quantified  using  procedures  outlined  in  Methods.  For  MMP-2,  hydrolysis  was 
measured  over  a  concentration  range  of  peptide.  Values  for  kcat  and  Km  were  derived  from 
Lineweaver-Burk  plots.  For  other  proteases  only  the  kcJKm  ratio  was  measured  (see  Methods). 
Additional  synthetic  peptides  were  synthesized  with  substitution  of  Pro  at  P3  and  Arg  at  P2  to  test  the 
influence  of  these  two  subsites  on  substrate  specificity.  Each  measurement  was  repeated  three  times 


and  the  relative  error  of  kcat/Km  in  these  measurements  was  less  or  equal  to  10%. 


MMP-2 

MMP-9 

MMP-7 

MMP-13 

Peptide 

Sequence 

kCat 

(s1) 

Km 

(mM) 

kcat  /Km 

(mV) 

kcat  /Km 
(mV) 

kcat  /Km 

(mV1) 

kcat  /Km 

(mV) 

Group  I 

A3 

SGAKPRA  l  LTA 

400 

3.6 

1.1E+05 

4.9E+04 

5.2E+03 

7.9E+03 

Group  II 

B49 

SGLRLAA  i  ITA 

510 

4.5 

1.1E+05 

7.9E+03 

1.9E+04 

8.5E+03 

B74 

SGESLAY  f  YTA 

622 

2.2 

2.8E+05 

2.6E+03 

7.9E+02 

2.0E+03 

B74P 

SGESPAY  i  YTA 

264 

0.6 

4.4E+05 

3.0E+04 

1.5E+03 

5.3E+03 

B74R 

SGESLRY  i  YTA 

61 

0.8 

7.6E+04 

1.0E+05 

7.1E+02 

1.1E+03 

C9 

SGRSLSR  1  LTA 

740 

4.4 

1.7E+05 

8.6E+02 

4.8E+03 

3.3E+03 

C9R 

SGRSLRR  i  LTA 

302 

6.4 

4.7E+04 

5.3E+04 

2.2E+03 

3.9E+03 

Group  III 

A13 

SGAVSW  i  LLTA 

105 

1.3 

8.1E+04 

5.4E+03 

6.4E+03 

2.9E+03 

A13P 

SGAPSW  i  LLTA 

204 

0.4 

5.1E+05 

1.7E+04 

5.4E+03 

4.9E+03 

A13R 

SGAVRW  i  LLTA 

28 

0.4 

7.0E+04 

2.0E+05 

3.5E+03 

2.9E+03 

B37 

SGANISD  i  LTA 

353 

2.4 

1.5E+05 

2.1E+03 

8.6E+03 

2.6E+03 

A34 

SGNRYSS  i  LTA 

202 

2.2 

9.2E+04 

2.3E+03 

1.1E+03 

3.9E+03 

Group  IV 

A21 

SGHMHKA  i  LTA 

124 

7.7 

1.6E+04 

2.1E+03 

n.d. 

n.d. 

A21A 

SGHMHAA  i  LTA 

601 

3.2 

1.9E+05 

1.9E+03 

n.d. 

n.d. 

n.d.=  not  determined 
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Table  IV.  Selectivity  of  individual  substrates  forMMP-2.  The  selectivity  of  each  of  the 
peptide  substrates  for  MMP-2  compared  to  other  MMPs  is  represented  by  dividing  the 
kcJKm  ratio  for  MMP-2  by  the  same  value  for  the  other  MMPs  that  were  tested.  In 
several  cases  mutation  of  the  P2  residue  gave  rise  to  dramatic  shifts  in  the  selectivity  ratio 


between  MMP-2  and  MMP-9.  These  values  are  in  bold  text. 


Selectivity  Ratio 

Peptide 

MMP-2/MMP-9 

MMP-2/MMP-7 

MMP-2/MMP- 1 3 

Group  I 
[P-X-X-XHy] 

A3 

2 

21 

14 

Group  II 

[I/L-Xpo-X-Xay] 

B49 

14 

6 

13 

B74 

108 

354 

140 

B74P 

15 

293 

83 

B74R 

0.8 

107 

69 

Group  III 
[XnyS-X-L] 

C9 

198 

35 

52 

C9R 

0.9 

21 

12 

A13 

15 

13 

28 

A13P 

30 

94 

104 

A13R 

0.4 

20 

24 

B37 

71 

17 

58 

A34 

40 

84 

24 

Group  IV 
[H-X-X-Xny] 

A21 

8 

n.d 

n.d. 

A21A 

100 

n.d. 

n.d. 

n.d.  =  not  determined 
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•  • 


Figure  2, 


EphBl 

EphB2 

MMP-2  + 

MMP-9  + 

”  + 

Kd 

172.6 
111.4 

79.6 
61.3 


49.0 


36.4 

24.7 
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